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RANDOM TRIANGLE REMOVAL 



TOM BOHMAN, ALAN FRIEZE, AND EYAL LUBETZKY 



Abstract. Starting from a complete graph on n vertices, repeatedly delete the edges of a uniformly 

chosen triangle. This stochastic process terminates once it arrives at a triangle-free graph, and the 

fundamental question is to estimate the final number of edges (equivalently, the time it takes the 

process to finish, or how many edge-disjoint triangles are packed via the random greedy algorithm). 

^vj BoUobas and Erdos (1990) conjectured that the expected final number of edges has order n''", 

7—i motivated by the study of the Ramsey number R{3,t). An upper bound of o{n^) was shown by 

^^ Spencer (1995) and independently by Rodl and Thoma (1996). Several bounds were given for 

CN variants and generalizations (e.g., Alon, Kim and Spencer (1997) and Wormald (1999)), while the 

rH best known upper bound for the original question of BoUobas and Erdos was ri^/*+°(^) due to 

'~{ Grable (1997). No nontrivial lower bound was available. 

^~5 Here we prove that with high probability the final number of edges in random triangle removal 

QQ is equal to n^'^'^°^^' , thus confirming the 3/2 exponent conjectured by BoUobas and Erdos and 

matching the predictions of Spencer et al. For the upper bound, for any fixed e > we construct a 

^~i family of exp(0(l/e)) graphs by gluing 0(l/e) triangles sequentially in a prescribed manner, and 

\^ dynamically track all homomorphisms from them, rooted at any two vertices, up to the point where 

r) n'^ "^^ edges remain. A system of martingales establishes concentration for these random variables 

around their analogous means in a random graph with corresponding edge density, and a key role 

is played by the self-correcting nature of the process. The lower bound builds on the estimates at 



C^ that very point to show that the process will typically terminate with at least n''" °''^' edges left. 



1. Introduction 



^^ Consider the following well-known stochastic process for generating a triangle-free graph, and at 

*K^ the same time creating a partial Steiner triple system. Start from a complete graph on n vertices 

^ and proceed to repeatedly remove the edges of uniformly chosen triangles. That is, letting G(0) 

C^l denote the initial graph, G{i + 1) is obtained from G{i) by selecting a triangle uniformly at random 

. out of all triangles in G{i) and deleting its 3 edges. The process terminates once no triangles 

^]^ remain, and the fundamental question is to estimate the stopping time 

CN To = min{i : G{i) is triangle-free} . 

• • This is equivalent to estimating the number of edges in the final triangle-free graph, since G{i) has 

• J^ precisely (2) — 3z edges by definition. As the triangles removed are mutually edge-disjoint, this 

/\^ process is precisely the random greedy algorithm for triangle packing. 

BoUobas and Erdos (1990) conjectured that the expected number of edges in G(to) has order 
n^'^ (see, e.g., l6l[7|), with the motivation of determining the Ramsey number R{3,t). Behind this 
conjecture was the intuition that the graph G{i) should be similar to an Erdos- Renyi random graph 
with the same edge density. Indeed, in the latter random graph with n vertices and en^'^ edges 
there are typically about nS^n^''^ triangles, thus, for small e, deleting all of its triangles one by one 
would still retain all but a negligible fraction of the edges. 

It was shown by Spencer ll4i in 1995, and independently by Rodl and Thoma [13] in 1996, that 



the final number of edges is 0(71^) with high probability (w.h.p.). In 1997, Grable 11 improved 
this to an upper bound of t^h/s+oII) w.h.p., and further described how similar arguments, using 
some more delicate calculations, should extend that result to 'n7/^+°y^). This remained the best 
upper bound prior to this work. No nontrivial lower bound was available. (See [10] for numerical 
simulations firmly supporting an answer of n^''^~^"^^J to this problem.) 
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Of the various works studying generalizations and variants of the problem, we mention two here. 
In his paper from 1999, Wormald [15] demonstrated how the differential equation method can 
yield a nontrivial upper bound on greedy packing of hyperedges in /c-uniform hypergraphs. For the 
special case k = 3, corresponding to triangle packing, this translated to a bound of n^~57+°(^). Also 
in the context of hypergraphs, Alon, Kim and Spencer [l] introduced in 1997 a semi-random variant 
of the aforementioned process (akin to the Rodl nibble [l2J yet with some key differences) which, 
they showed, finds nearly perfect matchings. Specialized to our setting, that process would result in 
a collection of edge-disjoint triangles on n vertices that covers all but 0(n^" log ' n) of the edges 
of the complete graph. Alon et al [I] then conjectured that the simple random greedy algorithm 
should match those results, and in particular — generalizing the BoUobas-Erdos conjecture — that 
applying it to find a maximal collection of A:-tuples with pairwise intersections at most k — 2 would 
leave out an expected number of at most n '^-i °^ ' uncovered {k — l)-tuples. They added that 
"at the moment we cannot prove that this is the case even for k = 3" , the focus of our work here. 

Joel Spencer offered $200 for a proof that the answer to the problem is n^/'^~^°W w.h.p. ( 
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The main result in this work establishes this precise statement, thus confirming the exponent 
conjectured by Bollobas and Erdos (1990). 

Theorem 1. Letro be the number of steps it takes the random triangle removal process to terminate 
starting from a complete graph on n vertices, and let E{tq) be the edge set of the final triangle-free 
graph. Then with high probability tq = n^/6 — n^""*""'^-*, or equivalently, |-E(ro)| = n^''^^°W _ 

We prove Theorem [ij by showing that w.h.p. all variables in a collection of e'^'^'^-'n^ random 
variables, carefully designed to support the analysis, stay close to their respective trajectories 



throughout the evolution of this stochastic hypergraph process (see ^1.1 for details). Establishing 
concentration for this large collection of variables hinges on their self-correction nature: the further 
a variable deviates from its trajectory, the stronger its drift is back toward its mean. However, 
turning this into a rigorous proof is quite challenging given that the various variables interact and 
errors (deviations from the mean) propagating from other variables may interfere in the attempt 
of one variable to correct itself. We construct a system of martingales to guarantee that the drift 
of a variable towards its mean will dominate the errors in our estimates for its peers. 

The tools developed here for proving such self-correcting estimates are generic and we believe 
they will find applications in other settings. In particular, these methods should support an analysis 
of the random greedy hypergraph matching process as well as that of the asymptotic final number 



of edges in the graph produced by the triangle-free process (see ^1.2). Progress on the latter process 



would likely yield improvement on the best known lower bound on the Ramsey number R{3,t). 

1.1. Methods. Our starting point for the upper bound is a system of martingales for dynamically 
tracking an ensemble of random variables consisting of the triangle count and all co-degrees in 
the graph. The self-correcting behavior of these variables is roughly seen as follows: should the 
co-degree of a given pair of vertices deviate above/below its average, then more/fewer than average 
triangles could shrink the co-neighborhood if selected for removal in the next round, and this 
compensation effect would eventually drive the variable back towards its mean. One can exploit 
this effect to maintain the concentration of all these variables around their analogous means in a 
corresponding Erdos- Renyi random graph, as long as the number of edges remains above n ''^~^°^^' . 
(In the short note |4| the authors applied this argument to match this upper bound due to Grable.) 
It is no coincidence that various methods break precisely at the exponent 7/4 as it corresponds 
to the inherent barrier where co-degrees become comparable to the variations in their values that 
arose earlier in the process. In order to carry out the analysis beyond this barrier, one can for 
instance enrich the ensemble of tracked variables to address the second level neighborhoods of pairs 
of vertices. This way one can avoid cumulative worst-case individual errors due to large summations 
of co-degree variables, en route to co-degree estimates which improve as the process evolves. Indeed, 
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Figure 1. A subset of the ensemble of about 2'^'^ triangular ladders that are tracked for any 
pair of roots to sustain the analysis of the upper bound to the point of v?''^^'^ edges for e = j^. 



these ideas can push this framework to successfully track the process to the point where rc'l'^^°^^' 
edges remain. However, beyond that point (corresponding to an edge density of about n~^''^) the 
size of common neighborhoods of triples would become negligible, foiling the analysis. This example 
shows the benefit of maintaining control over a large family of subgraphs, yet at the same time it 
demonstrates how for any family of hounded size subgraphs the framework will eventually collapse. 

In order to prove the rc'i'^^°y^i upper bound we track an arbitrarily large collection of random 
variables. A natural prerequisite for understanding the evolution of the number of homomorphisms 
from any small subgraph H to G{i) rooted at a given pair of vertices u, v is to have ample control 
over homomorphisms from subgraphs obtained by "gluing a triangle" to an edge to H (as the 
number of those dictates the probability of losing copies of H in the next round on account of 
losing the mentioned edge). Tracking these would involve additional larger subgraphs etc., and to 
end this cascade and obtain a system of variables which can self-sustain itself we do the following. 
To estimate the probability of losing the "last edge" in the largest tracked H, in lieu of counting 
homomorphisms from a "forward extension" (the result of gluing a new triangle onto the given 
edge) rooted at u,v, we fix our candidate for the edge in G{i) that would play the role of this 
last edge and then count homomorphisms rooted at u, v and featuring this specific edge. In other 
words, we replace the requirement for control over "forward extensions" with control over "backward 
extensions" from any given edge in G{i). Obviously, rooting the subgraphs at as many as 4 vertices 
can drastically decrease the expected number of homomorphisms, say to 0(1), while our analysis 
must involve error probabilities that tend to at a super-polynomial rate to account for our 
polynomial number of variables. 

These points lead to a careful definition of the ensemble of graphs we wish to track. We construct 
what we refer to as triangular ladders which, roughly put, are formed by repeatedly taking an 
existing element in the ensemble and gluing a triangle on one of its last added two edges. We do so 
until reaching a certain threshold on the number of vertices, namely 0(l/e) for some fixed e > 0, 
amounting to a family of 2'^^^''^^ graphs (see FigurejlJ). Special care needs to be taken so as to avoid 
certain substructures that would foil our analysis, yet ignoring these for the moment (postponing 
precise definitions to Q, note that each new step of adding a triangle (a new vertex and two new 
edges) also increases the density of the ladder under consideration, as long as the edge density is 
at least n"^"'*'^. When examining a given ladder in this ensemble, its forward extensions will also 
be in the ensemble, by construction, unless we exceed the size threshold. In the latter case, if the 
threshold is taken to be suitably large, we can safely appeal to a backward extension argument and 
wind up with polynomially many homomorphisms to G{i). 
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The main challenge in this approach is to come up with a canonical definition of the ensemble 
such that a uniform analysis could be applied to its arbitrarily many elements. In particular, since 
as usual the errors in one variable propagate to others, we end up with a system of linear constraints 
on 2^^^''^' variables. Fortunately, suitable canonical definitions of the main term in each variable 
as a function of its predecessor in the ensemble make this constraint system one without cyclic 
dependencies, allowing for a simple solution to it. Altogether, the triangular ladder ensemble can 
be maintained as long as there are 71^'^+'^ edges, and while so it sustains the analysis of the number 
of triangles. 

The lower bound builds on the fact that our analysis of the upper bound yielded asymptotic 
estimates for all co-degrees up to the point of n^' ^^^ edges, for arbitrarily small e > 0. Already a 
weaker bound on the co-degrees, when combined with an analysis of the evolution of the number of 
triangles, suffices to show that either the final number of edges has order at least n^'^ or at some 
point i there are \E{i)\ x re^'^ edges and (| — o{l))\E{i)\ edge-disjoint triangles. These give rise to 
a subset of n^'^~^^^ edge-disjoint triangles that existed earlier in the process, such that each one 
could guarantee an edge to survive in the final graph with probability n~^^'^\ and these events are 
mutually independent. Altogether this leads to at least n^"~ *^^' edges in the final graph w.h.p. 

1.2. Comparison with the triangle-free process. A different recipe for obtaining a random 
triangle-free graph is the so-called "triangle-free process". In that process, the (2) edges arrive 
one by one according to a uniformly chosen permutation, and each one is added if and only if no 
triangles are formed with it in the current graph. It was shown in Is] that the final number of edges 
in this process is w.h.p. n^i'^~^°W ^ and the correct order of n^'^-v/Iogn, along with its ramifications 
on the Ramsey number R{3,t), was recovered in [^ (see also [5] for generalizations). 

Similarly to that process, the triangle removal process studied here can be presented as taking a 
uniform ordering of the (3) triangles, then sequentially removing the edges of a triangle if and only 
if all 3 edges presently belong to the graph. The main result in this work shows that this process 
too culminates in a triangle-free graph with n^/'^~^°^^) edges. 

Despite the high level similarity between the two protocols, the triangle removal process has 
proven quite challenging already at the level of acquiring the correct exponent of the final number 
of edges. One easily seen difference is that, in the triangle- free process, at any given point there 
is a set of forbidden edges (whose addition would form at least one triangle) , yet regardless of its 
structure the next edge to be added is uniformly distributed over the legal ones. As long as the 
forbidden set of edges is negligible compared to the remaining legal edges, this process therefore 
mostly adds uniform edges just as in the Erdos-Renyi random graph. In particular, this is the case 
as long as o{n^''^) edges were added (even based on all the edges that arrived rather than those 
selected) . A coupling to the Erdos-Renyi random graph up to that point immediately gives a lower 
bound of order n^'^ on the expected number of edges at the end of the triangle-free process. One 
could hope for an analogous soft argument to hold for the triangle removal process, where it would 
translate to an upper bound (as it deletes rather than adds edges). 

However, in the triangle removal process studied here, deleting almost all edges save for n^/'^^°W 
forms a substantial challenge for the analysis. Already when the edge density is a small constant, 
still bounded away from 0, the set of forbidden triangles becomes much larger than the set of legal 
ones. Thus, in an atteinpt to analyze the process even up to a density of n~^ for some e > 0, one is 
forced to control the geometry of the remaining triangles quite delicately from its very beginning. 

2. Upper bound on the final number of edges modulo co-degree estimates 

Let (J^i) be the filtration given by the process. Let Nu = Nu{i) = {x £ Vg '■ xu £ E{i)} denote 
the neighborhood of a vertex u G Vg, let Nu,v = NuCiNtj be the co-neighborhood of vertices u,v £ Vg 
and let Yu^v = |-/Vu,^| denote their co-degree. Our goal is to estimate the number of triangles in 
G{i) which we denote by Q(i). The motivation behind tracking co-degrees is immediate from the 
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fact that Q{i) = | '^uv&EU) '^u,v, yet another important element is their effect on the evolution of 
Q. By definition, 

K[AQ\J^i] = - J2 iyu,v + Yu,u^ + Y,^^-2)/Q, (2.1) 

uvw^Q 

where here and in what follows AX = X{i + 1) — X{i) is the one-step change in the variable X. 
It is convenient to re-scale the number of steps i and introduce a notion of edge density as follows: 

t = t{i) = ^, p = p{i,n) = l-^ = l-6t. (2.2) 

With this notation we have 

\E{^)\= (^^ -3i= (^^ -^{l-p)n^ = ^{n^p-n), (2.3) 

and we see that, up to the negligible linear term, the number of edges corresponding to the edge 
density p is in line with that of the Erdos-Renyi random graph Q{n,p). Under the assumption that 
G{i) indeed resembles G{n,p) we expect that the number of triangles and co-degrees would satisfy 

Yu,v ~ np'^ and Q ^ -n^p^ . 



Turning to the evolution of the co-degrees, we see that 

+ n 

Q 



E[Ay„,j J-,] = - y: ^"'^^^^';, ^^'"'"'^K (2.4) 



xeN, 



u.v 



and so tight estimates on the co-degrees and Q can already support the analysis of the process. In 
the short note £4] the authors relied on these variables alone to establish a bound of 0{n^'^ log ' n) 
on the number of edges that survive to the conclusion of the algorithm. However, once the edge 
density drops to p = n~^'^ we arrive at Yu^v ^ n^'"^. As the variables Yu^v have order n and standard 
deviations of order ^/n while p is constant, we see that at the point p = n~^'^ the deviations that 
developed early in the process cease to be regarded as negligible and a refined analysis is required. 

One can achieve better precision for the co-degrees by introducing additional variables aimed 
at decreasing the variation for Y^^v as the process evolves. For instance, through estimates on 
the number of edges between the common and exclusive neighborhoods for every pair of vertices, 
along with some additional ingredients exploiting the self-correction phenomenon, one can gain an 
improvement on the 7/4 exponent. However, eventually these arguments break down, soon after 
some bounded-size subgraphs becomes too rare, thus foiling the concentration estimates. 

We remedy this via the triangular ladder ensemble, defined in ^and analyzed in |5J To sustain 
the analysis up to the point where n^/'^'^'^ edges remain we must simultaneously control exp(0(l/e)) 
types of random variables, yet our ultimate goal in those sections is simply to estimate all co-degrees 



{Yuv '■ u,v £ Vg}- The following corollary is a special case of Theorem 5.4 

Theorem 2.1. Define ( = C(Pi"-) to be 

( = n-^/'^p-^logn (2.5) 

and for some (arbitrarily large) absolute constant k > let 

r*=min{t:|Q/(inV)-l|>'^C'} • 
Then for every M > 3 w.h.p. 

\Y^,./{np')-l\<:i'''-'C 

for every u,v G Vg and all t such that t < Tq and p{t) > f^-i/2+i/A^_ 
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The heart of this work is in estabhshing the above theorem (Sections^]-[5|, which is complemented 
by the fohowing result: A multiplicative error of 1 + 0(C) for all co-degree variables Yu^v can be 
enhanced to bounds tight up to a factor of 1 + O(C^) on the number of triangles Q. 

Theorem 2.2. Set ^{p,n) = p~'^' ^^^logn and for some fixed a > let 

Ty = min < t : 3u,v such that \Yu^v — np \ > an ' p^> . (2-6) 

Then w.h.p. as long as t < Ty and p{t) > n^-'^'^log n we have 

\Q - nV/6| < a^n^p^^ . (2.7) 

Proof. Let 

3 3 

X{i) = Q-'^ , Z(i) = |X(i)|-aVp$2_ 

D 

We define below a narrow critical interval Iq that has its upper endpoint at the bound we aim to 
establish for \X\. As long as \X\ lies in this interval it is subject to a self-correcting drift, which 
turns out to be just enough to show that E[AZ | Ti] < 0. That is, as long as \X\ G Iq the sequence 
Z{i) is a supermartingale. Standard concentration estimates then imply that \X\ is unlikely to ever 
cross this critical interval, leading to the desired estimate on Q. 
Define 

A = log"^/^ n 

and observe that for all p > n~^''^ log n we have n'^^p~^^^ = 0{l/\/logn), whence as long as (2.7) 
is valid we have Q = (g + 0(A))n^p^. This clearly holds at the beginning of the process, and 
in what follows we may condition on this event as part of showing that the bound (2.7) will be 
maintained except with an error probability that tends to at a super-polynomial rate. 

We begin with estimates on the one-step expected changes of our variables. Recall that due 
to (2.1) we have 

E[AQ|J-,] = - Y: ^"''^^%^^"'""' =2-^E^»%- 

uvwGQ uvGE 

To bound E[A(5 | Ti] we thus need an estimate on "^uveE ^uv achieved by the next simple lemma. 
Lemma 2.3. Let ai, . . . , Om G K and suppose that \a — ai\ < 5 for all i and some a G M. Then 

m ^-^ m 

i 

Proof. The lower bound is due to Cauchy-Schwarz. For the upper bound fix 6 = ^ ^^ Oj and 
note that the assumption on the Oj's implies that |6 — aj| < 25 for all i. Observe that the convex 
function ^^ of achieves its maximum over the convex set {^2,1 ^j = ^^-l ^ ( flj (1^ ~ ^j| ^ 2(^} ) at 
an extremal point where Oj = 6 it 25 for all i. Hence, ai = b — 25 for [m,/2j indices, at = h + 25 for 
[m/2j indices and if m is odd there is a single Oj = h. In particular 

m 

^a^ < lm/2\{b-26f + lm/2\{b + 26)^ + bH{^^^^^A2)}<mb'^ + 4.m6^. ■ 

Since "^uvf^E^^,^ ~ ^Q ^^^ \Yu,v — np^\ < an^''^p^ by Eq. (2.6), an application of this lemma 
together with the fact that \E{i)\ = n'^p/2 — n/2 gives that 

Q 2^ ^«,^. - 1^1 - ^2 ' 
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together with the following upper bound: 



18Q 



Q 



uv£E 



< 



\E\ ' ' Q 
18Q 



+ 



2v?p 



ri' 



P(l-^) (^ + 0(A))n¥ 



18Q 



„ , Oip) + (12 + 0(A))a^$^ = ^ + (12 + 0(A))q^$ 



-a^np^$^ 



2/F>2 



(In the last equality we absorbed the 0(p)-terni into the 0(A) error-term factor of the last expression 
since A$^ — )• oo.) Adding this to our estimate for ¥,[AQ \ J^i] (where again the additive 2 may be 
absorbed in our error term) while observing that A(— gn^^) = 3np'^ + 0{p/n) yields 



E[AX I F^] 



18 



l»,3^3^ 



Q + i^'p 



2/F.2 



< (12 + 0(A))a^$^ + 0{p/n) = (12 + 0(A))a^$ 



again incorporating the 0{p/n) term into the 0(A) error. Thus, by the definition of X 

1 8 

E [AIATI I TA < ^\X\ + (12 + 0(A))a2$2 _ 

Now assume that io is the first round where |X| raises above cP"n?'p^'^, i.e., it enters the interval 
Iq = [a'^n'^p^'^ , a^n^p^'^) where q = (l — log^^ n) 



a . 



Further let 



r = min{i > iq : |X(i)| < a n p^ } . 



Since A (— a^n^p$^) = (6 + 0(^-))(l — j^^-^)a^<^^ and the error-term 0(^-) can easily be in- 
creased to 0(1/ A) as p > 1/n (with much room to spare), the upper bound on A|X| gives 



E [AZ I J-, , r > i] < [ - 3(d/a)2 + 2 + (1 - j^) + 0(A)] 6a^<^^ < 



6-o(l )„,2^2 

logn 



a^^\ 



It follows that {Z{j A t))j>jo is indeed a supermartingale for large n. 

Next consider the one-step variation of Z. Denoting the selected triangle in a given round by 
uvw, the change in Q following this round is at most Yu^v + Yu,w + yv,w and in light of our co-degree 
estimate (2.6) this expression deviates from its expected value of 3np^ by at most 3an^'¥$. In 
particular, \AZ\ = 0(-^/npologn) and letting po = p{io) = 1 — Qi^/'n? this ensures that 

Z{iQ) < (a^ — a^)n¥o'^^ + 0(-v/npologn) < — ^a^n^ologn, 

where the last inequality holds for large n. With at most n^po steps remaining until the process 
terminates, Hoeffding's inequality establishes that for some fixed c > 0, 

™/ r / N ^s / (n^pnlogn)^ 

P {yjj>i,{Z{j A r > 0} < exp -c / /° \ ' 

V n^PQ{y/npQ\ogny 

Altogether, w.h.p. |X(z)| < a^n^pf^"^ for all i, as required. 



-cn/po ^ ^-cn 



Observe that for all p > n ^'^ the function <I> defined in Theorem 2.2 satisfies 1 < <l>/logn < e, 
thus the estimates (|2.6l)-(2.7) precisely feature a relative error of order n~^''^p~^ logn in line with 



the definition of C in Theorem 2.1 Specifically, the conclusion of Theorem 2.1 pro vides the estimate 
\Yu,v — np'^l < an^''^p^ for a = 3^ , thus fulfilling the hypothesis of Theorem 



2.2 



en route to a 



bound of |(5/(gra^p^) — 1| < OC^ for C = (ae)^. Altogether, we can apply both theorems in tandem 
as long as p > 7^-i/2+i/M_ jj^ particular, at that point (2.7) guarantees that the process is still 
active with (g -|- o(l))n'^"^^' triangles, yet there are merely (^ — o{l))n^''^~^^' edges left. This 
establishes the upper bound in our main result modulo Theorem |2.1[ 
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3. From degrees to extension variables 

Our goal in this section is to show that strong control on degrees, co-degrees and the number of 
triangles implies some level of control on arbitrary subgraph counts. Let Yu denote the degree of 
the vertex u and as before let Yu^v and Q denote the co-degree of u, v and total number of triangles 
respectively. Our precise assumption will be that for some absolute constant C > 

\yu/{np)-l\ <CC forallueVb 

\yu,v/{np'^)-l\ <CC for all u, t> e Fg (3.1) 

|Q/(inV)-l| <Ce 

where <^ = C,{n,p) = n^^''^p^'^\ogn. 

The function C, begins the process aX C, = n~^''^~^°^^> and then gradually increases with p until 
reaching a value of n~'^~^°^^> for some (arbitrarily small) e > at the final stage of our analysis. 
However, in order to emphasize that the value of this £ plays no role in this section we will only 
make use of the fact that p{t) is such that 

log(l/C)/loglogn^oo. (3.2) 

Setting 



min{t > : Eqs. (3.1) or (3.2) are violated} , (3.3) 



we will show that as long as t < r^ we can w.h.p. asymptotically estimate the number of copies of 
any balanced (a precise definition follows) labeled graph with 0(1) vertices rooted at a prescribed 
subset of vertices. These estimates will feature a multiplicative error-term of 1 + 0(C ), where (5 > 
will only depend on the uniform bound on the sizes of the graphs under consideration. 

Definition 3.1. [extensions and subextensions] An extension graph is defined to be a graph 
H = {Vh,Eh) paired with a subset Ih Q Vh of distinguished vertices which form an independent 
set in H . The scaling of an extension graph is defined to be 

SH = SH{n,p) = n''"-'»p'", 

where en = {EhI, vh = {Vh] o,nd lh = \Ih\- The density of H is defined to be 

rriH = en/ivH - i^h) ■ 

A subextension K of H , denoted K d H , is an extension graph K = {Vk,Ek) with Vk C Vh, 
Ek C Eh, and the same distinguished set Ik = Ih (thus Vk 3 Ih)- We say that K is a proper 
subextension, denoted by K C^ H , whenever Ek C Eh- We will denote the trivial subextension 
K = (Ih,^) (the edgeless graph on Ih) by 1. For any subextension K G H , define the quotient 
H/K to be the extension graph H/K = (Vh,Eh\Ek) with the distinguished vertex set Ih/k = Vk- 
Observe that 

Sh = Sk Sh/k - 

Let H = {Vh,Eh) be an extension graph associated with the distinguished subset Ih C Vh as 



in Definition 3.1 and set en = \Eh\, vh = \Vh\ and lh = \Ih\- Let (f : Ih ^ Vg he an injection. 
We are interesting in tracking the number of copies of H in G = G{i) with filn) playing the role 
of the distinguished vertices. That is, we are interested in counting the extensions from ^{Ih) to 
copies of H. Formally, an injective map ip : Vh — )• Vq extends y? to a copy of ii^ if V' is a graph 
homomorphism that agrees with ip, that is, ip \ih= ^ and uv £ Eh implies that ip{u)ip{v) S E. 
Define 

^H,(^ = # w £ Vh — )• Vg : "0 extends (p to a copy of H> 

Note that ^H,ip counts labeled copies of H rooted at the given vertex subset (p{Ih)- 

We will track the variable ^H,<fi for extension graphs H that are dense relative to their subex- 
tensions. More precisely, we say that H is balanced if its density satisfies mn > rriK for any 
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subextension K of H. Furthermore, H is strictly balanced if its density is strictly larger than 
the densities of all of its subextensions. Our aim is to control the variable ^H,ifi for any balanced 
extension graph H up to the point where the scaling of H becomes constant, formulated as follows. 
Let 

tH = min{i : SH{n,p{t)) < 1} , (3.4) 

where the times t being considered here are, as usual, of the form i/n^ for integer i. Recalling that 
TTT'H = eu/ivH — i-h) observe that pitu) = n~^''^" , thus 

rriHi > mH2 <^ t^i < i^a (3.5) 

and in particular any balanced H has tn < tx for any subextension K of H {& strict inequality 
holds when H is strictly balanced). Thus, the threshold up to which we can track ^H,ip will indeed 
be dictated by tff rather than by tx for some K <Z H. 

Finer precision in tracking the variables ^H,(p will be achievable up to a point slightly earlier than 
tn such that Sh is still reasonably large. To this end it will be useful to generalize the quantity tn 



as follows: With (2.5) in mind, for any r > let 

t^(r) = min {t : 5j,(n,p(t)) < 1/^} (3.6) 

(again selecting among all time-points of the form t = ijr? for integer values of i). We will establish 
these more precise bounds on ^H,if> up to t~^{S) for some suitably small (5 > 0, i.e., up to a point 
where the scaling Su is still super-logarithmic in n. (Recall that C(i) is decreasing in t and yet 
log(l/C) tends to oo faster than log log n.) 

When examining the times tu and t~^ one ought to keep in mind that, for any extension graph 
U on 0(1) vertices, the value of Sh decreases within a single step of the process by a multiplicative 
factor of 1 + 0{rr'^). In particular, in that case (1 — 0(n~^))^~'' < Sh < ('"^ at time tjj{r). 

The main result of this section is the following estimate on the variables ^h ip, where the pro- 
portional error will be in terms of (", the varying proportional error we have for the variables Yu,v 
and Q. 

Theorem 3.2. For every L > there is some < 5 < 1 so that w.h.p. for all t < t^ and every 
extension graph H on vh < L vertices and injection ip : Ih ^ Vq we have: 

(i) If H is balanced then \^h,lp — Sh\ < ShC^ for all t < t^{^/6). 

(a) If H is strictly balanced then \^H,ip - Sh\ < (cSH)^"^(logn)'' with p = Se^^-'f^ holds for all 
tJiiVS) <t<tH. 



Before proving Theorem 3.2 we will derive the following corollary which will be applicable to 



extension graphs that are not necessarily balanced. 

Corollary 3.3. Fix L > and let < t < t^,. Then w.h.p. for every extension graph H on vh < L 
vertices with p = pn = 3exp{vH — tn) we have: 

(1) IfSxit) > 1 for all subextensions K C H then '^H,ip{t) < (1 + o{l))SHilogn)P. 

(2) If Suixit) < 1 for all subextensions K C H then ^H,ip{t) < (1 + o(l))(logn)''. 



Proof. Assume w.h.p. that the estimates in Theorem 3.2 hold simultaneously for all extension graphs 
on at most L vertices throughout time t. We may further assume that every vertex v £ Vh \ Ih 
has deg(t') > 1. Indeed, for the statement in Part (II| this is w.l.o.g. since adding any isolated 
vertex to Vh \ Ih would contribute a factor of (1 — o(l))n to ^//,<^, balanced by a factor of n to 
Sh- For Part (pi) this assumption is implied by the hypothesis Sn/Kit) < 1, as we can take some 
subextension K with Vk = Vh \ {v} to obtain that {p{t))'^^^^'"' < 1/n. 

Define a sequence of strictly increasing sets Ih = ^o £ ^i $ ^2 ^ • ■ ■ $ X^ = Vh as follows. 
Associate each candidate for Xj for i > 1, one that contains Xi-i, with the extension graph 
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Ki = {Xi,Ei) where 

Ei = {(u, v) G Eh : u,v £ Xi and either u ^ Xj_i or v ^ Xi^i } , 

i.e., the set of edges of the induced subgraph on Xi that do not have both endpoints in Xj_.i, and 
set the distinguished vertex set of Ki to be Ixi = Xi^i (guaranteed to be an independent set by 
our definition of Ei). With this definition in mind, let Xi be the subset of vertices that maximizes 
niKi ■ If more than one such subset exists, let Xi be one of these which has minimal cardinality. 

To see that the sequence of Xj's is strictly increasing, let i > 1 and consider the potential values 
for Xi building upon some Xj_i C Vh- Taking Xi = Xi-i would yield rriKi ~ 0' strictly smaller 
than the density that would correspond to Xi = H (otherwise Vh \ ^i-i would consist of isolated 
vertices in H, contradicting our minimal degree assumption). Hence, indeed Xj_i C Xi and overall 
Ki is strictly balanced by construction. (We note in passing that Kk = H/Kk-i and that Ki is 
only a subextension of H for i <\ since Ih £ Iki for i > 1.) 

Motivating these definitions is the fact that for a given sequence Xq, . . . ,Xk as above one can 
recover ip £ Vh — ^ Vg (counted by ^h,<^) iteratively from a sequence of V'i £ Vxi — >• Vq (in which 
ipo = f and (V'j)|xi_i = V'i-i for ah i), hence 

k 

(Here and throughout this section, with a slight abuse of notation we use the notation tpi G ^Xi to 
denote an injective map -01 counted by the corresponding variable ^Ki-) 

To prove Part ([l]) of the corollary, observe that Ki C H and so by hypothesis we have 5^1 (t) > I, 
or equivalently, t < tx^. For any 2 <i < k, the fact that Xj_i C Xi implies that mKi_i is as large 
as rriK', where K' is the extension graph with vertex set Xi, distinguished vertices Ik' = Xi-2 
and all edges of H between vertices of Xi excluding those whose endpoints both lie in Xi-2- Since 
Ek' = Ei-i U Ei we get 



\Ei-i\ \Ei-i\ + \Ei 



which readily implies that 



-^j-l| ~ I^J-2| \^i\ ~ \^i 



l-E/J-ll . \Ei 



mx,_i = -T^ , rr^ r > ^q ^; r = mK, 



Equivalently (via (3.5)), t^i < ^i^^+i and by induction we thus have t < t^. for all i. Recalling that 



Ki is strictly balanced, we can appeal to Theorem |3 . 2 1 and derive that for any injection rj : Ik^ — >• Vq 
^K„„(t) < (l + max{C^ (5xJ^^(logn)^'})5A. < (1 + o(l))5/<,(logn)^% 



\k 



where pi = 3exp(|Xj| — |Xj_i|). It is easily seen that by definition Sh = Y\i=iSKi and so revisit- 
ing (3.7) yields that under the hypothesis of Part ([I]) we have 



-^hA^) < (l + o(l))5iy(logn)Stift. 

The proof of Part (II| is completed by the fact that exp(|Xj| — |Xj_i|) > 2 for all i (since the Xj's 
are strictly increasing) and in particular 

^p,, = 3j]el^'l-l^-il<3nel'''l-l'''-^l = 3e^«-'«=p. (3.8) 

i i i 

It remains to prove Part ([2]). Observe that the hypothesis in this part implies that Snit) < 1 
(by taking K = 1), that is we are considering some time point t > tn- 
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To utilize the assumption on the scahng of quotients of H, we go back to the definition of the 
sequence of Xj's and argue that 

iTT-Ki — "^-f//_ft:,_i for aU 2 > 1 . (3-9) 

Indeed, this follows from the fact that selecting Xi = H would exactly correspond to having 
Ki = H/Ki-i, and yet Xi is chosen so as to maximize the value of rriXi- 

While -fCj-i itself may not be a subextension of H (one has Ki-i <Z H iS i < 1), one can modify 
its distinguished vertex set to remedy this fact. Namely, the extension graph K' = (Xj_i,£'j_i) 
with the distinguished vertex set Ik' = Ih is a subextension of H, and since Xq C Xj_i we get 
that 

where the last inequality follows from our hypothesis on all the quotients of H. 

From the inequality Sj{/k^_^ < 1 established above we infer that t > tjj/j^._^. At the same time, 
the combination of (3.5) and (3.9) implies that tff/Ki-i ^ ^Ki, and altogether t > txi- Now, since 
^ Ki,ip{t) is monotone non-increasing in t and we only aim to bound this quantity from above, it 
suffices to provide an upper bound on ^Ki,Lp{tKi)- To this end, apply Part ([n]) of Theorem |3.2 



(bearing in mind that Sxi = 1 + 0{n ^) at time t^i and that Ki is strictly balanced) to get that 
for any injection r/ : Ixi — ^ Vq 

^x„r,(ii^J < (1 + 0{n-^)) (1 + o(l)) (1 + (logn)^O , where pi = 3el^>l-l^-il 



At this point we again appeal to (3.7) to infer that 



^H,^(t) < (1 + o(l))(logn)Stift < (1 + o(l))(logi 



where we used the fact that ^^ Pi < p as seen in (3.8). This concludes the proof. ■ 

Proof of Theorem 13. 2[ Let Al denote the set of all balanced extension graphs on at most L 
vertices and at least one edge (in the edgeless case one has '^H,ip = <Sh trivially by definition). 
Since Al is finite and uih > for all H G Al we can choose a sufficiently small constant < 5 < 1 
as follows: 



mm 



1 1 



rriA^mB j { {3eAf 



Combining the fact that {m^ — m^ \ > \/d whenever niA 7^ tub together with the identity 
p{tA) = n~-^' "^-^ implies that for any A, B G Al with tA < ts 

SbHa) = (n'^~B\{tA)X^ = n("^s'-"^l')es > ^VScB > Q-^rs ^ 
where the last inequality was due to e^ > 1 (with room to spare). In particular, 

tA<tB =^ tA< t]^{V5) for aU A, B £ Al ■ 



Contrary to the monotonicity of t^ in i^^A as per (3.5), it is generally not true that t^{r) is monotone 
in niA- However, the above definition of 6 will enable us to still relate these thresholds as we will 
later argue that if itia > itlb then t^{^/S) < t^((3 — o{l))6). 

Proof of Part (m). Let H and tp be fixed. We will apply the previously used martingale arguments 
based on critical intervals. We begin with the expected change in ^H,ip- Here the probability 
of deleting a given copy of H corresponds as usual to the number of triangles resting on each of 
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its edges, and summing these separately can only incur a double count of 0(1) since vh = 0{1). 
Therefore, 

= -(6 + 0(0)%^. 
Now set X = ^ H,ip — Sh- Our critical interval for \X\ will be 

To evaluate the one-step change in X, we first see that the one-step change in Sh is deterministically 
given by 

A5h = n^«-'« {{p-6/nY" -/^) = -n^^-^^^ (^ _ o(n-2))^^^' 

= _ (1 + 0{n-^p-')) ^Sh = - (6 + 0(C)) ^Sh . (3.11) 

When |X| G Iq, we therefore have 

E[AX] = - {6 + 0(0) ^{Sh+X) + {6 + o{O^Sh 

= -(6 + o{l))^X + O{0^Sh = -(6 + o(l))^X, 
n^p n'^p n^p 

where the last inequality used the fact that ShC = o(X) whenever \X\ G Iij, as then we have 
|X| X ShC^ for (5 < 1. Next we define what would be a supermartingale when \X\ G /^, namely 

Z=\X\-ShQ^ . 



As in (3.11), the one-step change in ShC deterministically satisfies 

AShC' = - (1 + 0{n-^p-')) ^M^ShQ' = -(6 + o{l)f-^SHC' , (3.12) 

hence whenever \X\ £ lii, we have 

E[AZ] = -(6 + o{l))^\X\ + (6 + o(l))(eH - (^)^ 

^Zp i^Zp 

with the last inequality valid for sufficiently large n. Thus, Z{t A Ti^) is indeed a supermartingale 
when Tij) is the stopping time for having \X\ exit the interval /\ji. 

We turn to establish concentration for Z(t A r^). We will invoke the following inequality due to 
Freedman [9], which was originally stated for martingales yet its proof extends essentially unmod- 
ified to supermartingales. 

Theorem 3.4 (|9|, Thm 1.6). Let {Sq, Si, . . .) be a supermartingale w.r.t. a filter {J-i). Suppose 
that Si+i — Si < B for all i, and write Vt = X^jZq ^ [(•^'i+i ~ •S'i)^ I ^i\ ■ Then for any s,v > 

^{{St >So + s,Vt< v} for some t) < exp ( - "i „ , j • 

' [ 2[v + Bsj J 
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To qualify an application of this theorem, we must first establish L°° and L^ bounds on AZ, 
which will follow from individually examining the various edges of H whose potential removal at 
time t < tJj{V6) may decrement the value of ^H,ip- 

For each edge ab £ Eh we let T = Tab denote the subextension of H that has the smallest 
scaling among subextensions that contain ab (and a minimal number of vertices if there is more 
than one subgraph with the minimum scaling). Notice that tn < ^t since T is a subextension of 
H. Furthermore, if K is any subextension of T then, being also a subextension of H we again have 
tii ^ tx- In particular, t < tx for all i^ C T and we can invoke Corollary 3.3 (Part M) to give 



^T,^ < {I + o{l))ST{logny^ , 

where px = 3ex.p{vT — ^t)- 

It is worthwhile explaining why invoking Corollary 3.3 at this point is justified. Indeed, one 
can prove the statements of both parts of the present theorem (and in effect also its corollary) by 
gradually extending the time range where they all hold. Formally, we consider the stopping time 
f at which one of the assertions of the theorem fails (for any extension graph) . As long as f > t 
we may assume the bounds of this theorem (and hence also those of its corollary) as part of our 
attempt to increase their validity to the next time step, and so on. The bound we will present for 
Z{t A Tij,) will remain valid in conjunction with the additional stopping time f, and similarly for 
the second part of the theorem (featuring an analogous proof). Finally, analyzing Z{t A r^ A f) 
and its Part ([ii| counterpart is sufficient for proving the theorem since, by definition, at least one 
of these supermartingales (for the various extension graphs) will need to violate its corresponding 
large deviation bound in order for f to occur. 

Consider the effect of removing a given edge xy G Eq (as part of some triangle xyz) on ^ H,ip- 
Copies of H are lost iff xy corresponds to some edge ab G Eh in at least one such copy, hence we 
can bound this amount by adjusting Ih to include a, 6, which would be mapped to x, y. Namely, 
let H* = H*f^ be the extension graph with Ih* = Ih ^ {o, b} whose edges are Eh minus any edges 
between vertices of Ih*- Further let (p* be the corresponding injection extending ip to include x,y 
as the images of a, b (if this is inconsistent with ^p then this copy remains intact) to yield 

abeEff 

Recalling the above definition of T = Tat, let T^^ similarly include a,b in its distinguished set of 
vertices. We then get that 

\^^hJ<Y1 E ^^.VtV.= E E "^h/t,,, (3.13) 

abeEn rie^T*, u>* abeEn TyS-I-r* ^* 

ab ^ ab ^ 

where the equality is simply due to the fact that Vr = Vr* , thus Ha^/'^ab ~ ^ab/'^^ ^^^ moreover 
a,b £ Vt and so H'^b/T = H/T. Consider this last term ^ H/T,r]- We claim that 

*H/r,„ < (1 + o{l))SH/T{lognY«/T . (3.14) 

Indeed, by the choice of T we have Sh > St (as H is itself could play the role of T), and so 
Sh/t — Sh/St > 1- Moreover, if K is any subextension of H with ab G Ek and Vr C Vk then 
again Sj^/j' > 1. Yet by definition, if ab ^ Ek then K/T = K/T where K includes the extra edge 
ab (as both of its endpoints lie in T) and altogether every K C H with Vr C Vk has S^/t ^ 1- 
Noticing that every subextension of H/T can be written as K/T for such K now qualifies an 
application of Part (IT]) of Corollary |3.3| from which the desired inequality (3.14) follows. 

To conclude the -L°°-bound on AZ we need to estimate ^t* ,</3* • We will argue that this quantity 
is at most log ' ' n. Let U* be a subextension of T*, and let U be the modification of U* into a 
subextension of T via setting its distinguished vertex set to It and adding any edges among these 
vertices in T (in particular the edge ab). By construction, Su* /Sjj = St* /St- In addition, since U 
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contains the edge ab then by the choice of T we must have Sjj > St- Combining the two we get 
that 5(7* > St* and in particular St* m* ^ !> i-6-, ah the quotients of T* have scahng at most 1. 
Part pi) of Corollary 3.3 now implies that 



^T*„v* < (l + o(l))(logn)''^* < (l + o(l))(logn)''^ 
Combining this with (3.13) and ( |3.14[ ) we get that 

\^^tJ = O {SH/T{^ogn)P^/^+''^) , 



while recalling the estimates for ASh and AShC i^i (|3.11),(3.12[) we see that they are negligible 



in comparison (note that St < iTpp since a valid candidate for T is the graph containing the single 
edge ab) now extends this to the following L°° bound on the one-step change in Z. 

\AZ\ = O (5H/T(logn)''^/^+^^) . 

For an L^ bound on the one-step change AZ, again consider the effect of removing an edge 
corresponding to some ab £ Eh- For a given ip £ ^T,ip, the probability that the next drawn triangle 
will contain this edge is i^^(a)i/)(fe)/Q = (6-|-o(l))n~^p~^ due to our estimates on co-degrees and the 
total number of triangles present. In particular, the probability to remove an edge corresponding 
to ab G Eh in any copy specified by some ijj G ^t,(/3 is at most 

^T,<^0(n^V^) = O {St (log ny^n^^p-^) . 

As shown above, this can only modify Z by an additive term of O{Sh/t0-OE^)^"^^^^^)- Thus, if 
we enter the critical interval /ij, at some time to = t(^o) then for any i > iq, 

E[(AZ)2 I J-,] < Y, o{ST{lognr^n-^p-'{SH/Tiiognr^/^-^P^f) 

abeEn 

= O {SHSH/Tn-^p-\lognfP«/^+^P^) . 

The quantities Sh and Sh/t above are w.r.t. time t{i), yet our scaling is monotone decreasing in i 
and so we can replace these by the corresponding scaling terms at to- Summing the result over at 
most 0{n?p) remaining possible steps gives 

^E[(AZ)2 \Ti]<0 (5H(to)5H/T(io)(logn)2^«/-+3PT) ^ 

i>io 

where Snito) is short for SH{'n,p{tQ)). Since e^ > one can bound Yli>io'^Hit)SH/Ti't)P~^it) via 
integration to obtain that it is 0{'n?SH{to)SH/T{'to)), leading to the above inequality. 

Compare this last bound for the summation ^jE[(AZ)^ | Ti] with the product of the L°° 
bound B = O [SH/Ti^ogn)''^/^^^'^) and a deviation of s = t^^ShC^- The upper bound on 
^^E[(AZ)^ I Ti] has order at least BsC,^ [log n)^"^'^'^ ^^, clearly dominating Bs- Turning to 
Freedman's inequality and once again applying the fact that Sh = StSh/t we get that for some 
fixed c > 0, 

F(z{t A T^) > Z(to) + s for some t > to) < exp (-cSTitoK^^logny^PH/r-SpA 

We claim that 5T(to) > (1 — o(l))C^ , and in fact this holds for any to < t < t]j{V6). If rriT < mn 
then t < t^(\M) since t < tn < tT and given our definition of S. Moreover, VS < | (as implied by 
Eq. ( 3.10[ ) and the fact that e^ > 1) thus in this case 

5r > (1 - o(l))r^ > (1 - oim-'' . (3.15) 

On the other hand, when rriT = mn we can write 

St = {n^/"'^py^ = {n^/"'"py^ = {ShT^^'" > 5^/"^ > (1 - o(l))C"^/"" , 
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with the final inequahty due to t < tjj{\/d). By (3.10) we have y6 < l/(3e//), hence (3.15) is again 
vahd. Altogether we can conclude that 

F(z{t A r^) > Z{to) + s for some t > to) < exp (-{c - o{l))C-^{logn)-'^P"/T-^pA 

< exp (— log n) , 

where the last inequality is thanks to ( |3.2[ ). Eq. ( |3.15| ) similarly implies that 

SH/AlognfW = g(logn)«(i) < ShC''-"'''^ , 

that is, the L°°-bound on Z, which is valid regardless of whether \X\ E Iij,, satisfies B = o{ShC^)- 
Accounting for the first step in which \X\ enters the interval /iji we get 

Z{to) + s < -^ShC' + B + ^ShC' < 

for sufficiently large n. In particular, \X\ < Sh(^ for all t > to except with probability exp(— log^ n). 
This probability allows us to absorb a union bound over all n*^^^^ choices for {H,ip) and to, thus 
concluding the proof of Part (m) of the theorem. 

Proof of Part (III]). Let H and ip be fixed, and let X = '^H,ip — -Sh- We follow the framework of 
applying a martingale argument within an appropriate critical interval for \X\, which would now 
be 

Ivt = [(1 - f )(5H)^-^(logn)''« , (5H)^-^(logn)''«] , 

where pu = 3exp(f// — i^h)- From Part (m) of the theorem we get that \X\ < ShC ^t time 

t = tJj{VS). Yet at this time Sh < C^ by definition and so ^^C"^ < 1. We thus infer that 

l-'^l < 'ShC ^ 'S}f 1 and in particular \X\ ^ Iq, at time t^{v5) provided that n is large enough. 
As in the proof of the first part of the theorem. 



and so 



E [A*,^,^] = -(6 + 0{C))^^H,^ , AcSh = -(6 + o{C))^Sh 



K[AX] = -(6 + o{l))^X + O{0^Sh = -(6 + o{l))^X , 



where the last equality will be justified by showing that ShC = o(|X|) when \X\ G Iq,. Indeed, with 
< (5 < 1 a sufficient condition for ShC = o{{SH)^'^{lognyH) is to have Sh = o (C^Hlogn)^^), 
which in turn holds since t > tj^(vo) and thanks to the assumption in (3.2) on the decay rate of C- 
Similarly to ( [3lT| ) we have A(5h)1-^ = -(6 + o(C))(l - \/5)^(5h)^"'^ deterministically, 
hence the random variable defined by 

Z = |X|-(5^)i-^(lognr« 

satisfies the following whenever \X\ G /^ and for any sufficiently large n: 

E[AZ] = -(6 + o(l))^|X| + (6 + o(l))(l - V6)^{SH)^-'^\logny" < . 

Therefore, if to = t(io) is the first time where |X| > (1 — ^)(5//)^^^'^(logn)^^ and t^jj = min{t > 
to : [-'^(t)! ^ /"I-} then Z{t A Tip) is a supermartingale. 

For L°° and L^ bounds on AZ we again wish to analyze the effect of removing some edge xy G Eg 
playing the role of some ab £ Eh in a copy of H. Letting T = Tab as before be the subextension 
of H that has the smallest scaling among subextensions that have ab £ Et, we now claim that in 
the setting of this part of the theorem we in fact have T = H. Indeed, since H is strictly balanced. 
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any K C^ H must have tff < tx and thus also tu < t^{\/o). The hypothesis tJj{\/5) < t < tn then 

imphes that Sh < (~^^ and yet Sk > C~ > precluding the choice Tab = K. 

Revisiting the argument used to prove the previous part of the theorem, we let i/*^ denote the 
extension graph that adds a, b to Ih and let ip* denote a corresponding injection. We then have 

and yet, the by the minimality of Sh* we must have Sjj* jk* ^ 1 foi' any subextension K* C //*^ 
and so Part Q of Corollary 3.3 implies that 

*H*„v* < (1 + 0(1)) (log 71)"^*. 

Combining these two equations yields that the potential change in '^H,ip is at most 

\Ph*, 

ab 



|A^//,(p| = 0( max (logn 



ab&Ejj 



Similarly to (3.11), the quantities Sh and {Sh) (logn)''^ deterministically change by at most 
0{SHn-^p-^{\ogn)PH) = O {SHQ^i^ognf+P") = o(l) 

due to the fact that Sh < Q'^^^ {oi 5 <\ and the decay rate of Q. Altogether we get 

|AZ| = o(max{(logn)^^ai. : ah G Eh]) = O ((logn)^^/'^)^^) , (3.16) 



where we used the fact that lh* > i^h since at least one of a, b is in Ih* \ Ih (as otherwise ab 4 Eh 

ab ^ ^ ~ 

as Ih is an independent set), and so pn = 3e'""~^" > epn*- Note that this L°°-bound holds in 
general regardless of whether \X\ G l^. In particular, since \X\ < (1 — ^)(5//)^~^ (logn)^^ one 
step prior to to and can only increase by 0((logn)'^'^^''^) at a given step, we have 

\X{to)\ = (1 - ^)(5H(to))'-^(logn)^^ + O ((logn)(V-)p«) 

= {1-4 + o{l)){SH{to))'-'^\logny« , (3.17) 

(where the term (logn)^^''^'''^ was absorbed in the higher order (log n)''^ -term) and similarly, for 
all to < t < T<^ 

|X(t)|<(l + o(l))(5^(t))i--^(logn)^«. 
As such, since the probability to remove an edge that corresponds to some ab G Eh in a given copy 
tp G "^ H,ip{i) is at most ^^(a)i/'(b)/Q = (6 + o(l))n~^p~^, the overall probability for this event to 
occur for some V' G ^H,<^(t) is at most 

^H,^(i)0(n- V) < O ((cSH(t) + {SH{t))'~'^'{\ogn)P") n'^p-'^ 

This implies that for any t = t{i) for i > ig we have that E[(AZ)^ | J^j] is at most 

Y, o((5^(t) + (5^(t))i-^(logn)''«)n-Vt)-i(logn)2-"''«) , 

ab&En 

and integrating this over the O{n^p{to)) remaining possible steps (again noting that eH{l — VS) > 

and so Yli>ioi'^Hy~ P^^ = 0{n'^{SH{to))^~ ) as in the proof of Part (ji| of this theorem) now 
gives 

i>io ab^Eff 



^E[(AZ)2 \J^,]< Y O [(SH{to) + {SH{to))'-'^'{lognr«) {lognf^''^ 

ab^EH 

O ((SH{to) + {SH{to))'-'^HlognrA {lognf'-'pA . (3.18) 
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Figure 2. The set of triangular ladders ^t^ (labeled by vr) for all |7r| < 4. 



Compare this bound with the product of the L°° bound given in (3.16) and a deviation value 



of s = ^(5H(to))^"^(logn)''«. This product has order (5//(to))^"^'^(logn)(i+^ ')/'■» whereas 



"{^t > to : Z{t A T^) > Z{to) + s] < exp 



our bound in ( 3.18| ) has order at least {Snito))^ ^ (logn)'^^^"^ '^^ , strictly larger. We can thus 
extract from Freedman's inequality that for some fixed c > 0, 

. (5j,(to))^-^v^(logn)(^-^'--^)PH \ 

^Snito) + (5^(to))i-^(logn)P^y 

which, according to the larger term between {Sh , {•Sh)^ {^ogn)^'^}, is at most 

exp (-(c/2)min{(5H)^-^^(logn)(2-2e-^)Pi. ^ (5j/)i-^(logn)(i-2e-i)p«|^ 

Going back to the definition pn = 3ex.p{vH — i^h), since vh — i-h ^ ^ (otherwise H is edgeless) we 
have ph > 3e and thus the exponent of the logn term above is at least 3(e — 2) > 2 and we can 
deduce that for any sufficiently large n we have 

3t>to: Z{t A Tq,) > Z{to) + s) < exp(- log^ n) . 



Finally, recall that Z(to) 



-v^ 



o(l))(5iy(to))^"'^(logn)'''ff as guaranteed by (|3.17|). We can 



thus conclude that Z{t) < 0, that is \X{t)\ < (cSH(t))^"'^(logn)^», for all t > to and large enough 
n except with probability exp(— log n). A union bound over the n*^'^) choices for to and the pairs 
{H, ip) completes the proof of the theorem. ■ 



4. Triangular ladders and their extensions 

Definition 4.1. [triangular ladders] Let 11 C {e,0, 1}* denote the set of all words which do not 
have in indices 1 and 2, have at most 2 occurrences of e and do not contain the substrings eO, 
elO or ele. For any vr E 11 o/ length k > 1, the triangular ladder J^-,^ is the following labeled 
undirected graph on the vertex set {0, 1, 2, . . . , /c + 1}; 

(i) k = 1: J£\ has the two edges {02, 12}; J£^ has the single edge 12. 

(a) k > 2: let H = J£^~ where vr" is the (|7r| — l)-prefix o/vr. Let ^^^ be the graph obtained from 
H as follows: First add the vertex k + 1 and the edge {k,k + 1). In addition, if 7r{k) = 1 
add the edge {k — l,k + 1), whereas if TT{k) = add the edge (v, k + 1), where v is the unique 
neighbor of k in H other than k — 1. 
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Figure 3. Fans in triangular ladders: The vertices highlighted in red form 4- fans at vertices 
2, 4 and 5 (note the exceptional structure of the fan at a = 4 due to having 7r(o — 2) = e). 

Observe that vertex is isolated iff 7r(l) = e. The set of triangular ladders of length at most 5 is 
illustrated in Fig [2] Implicit in the above definition is the fact that by construction, the last vertex 
of Jfyr, i.e., the vertex A: + 1 when |7r| = k, has precisely two neighbors if TT{k) G {0, 1} (one of which 
is the vertex k) and exactly one neighbor (the vertex k) when 7r(A;) = e. In particular, since the 
substring eO cannot appear in vr G 11, the case 7r{k) = in Item ml) above has 7r(A; — 1) 7^ e and so 
the vertex k indeed has two neighbors in H. Further note that if 7r(£) = e for i > 1 then the vertex 
£ is a cut-vertex of ^,r- Namely, in this case there is no edge (a, b) G i?(^7r) such that a < i < b. 

Definition 4.2. The scaling of a triangular ladder ^^^ w.r.t. some given n and p is defined to be 
S^ = S^{n,p) = 72"--^; where e^ = \E{^^)\ and v^ = \V{^^)\. 

The scaling S^ will correspond asymptotically to the expected number of labeled copies of ^t^ 
rooted at two given vertices (matching the labels and 1) along our process. Recalling the notion 
of extension graphs (see Definition |3.1| ), 3-,^ is simply the scaling of the extension graph obtained 
from ^T^ via the distinguishing vertex set I = {0, 1}. Observe that if Ivrl = k and vr contains 
< s < 2 copies of e then 5^ = nf'p^^'^. 

Our focus will be on tracking a special subset of the family of all triangular ladders, whose 
formulation will require the following definition. 

Definition 4.3 (/-fan). For / > 3, an /-fan at some vertex a > in a triangular ladder ^-^ is a 
sequence of vertices that has one of the following two forms: 

(1) 7r(a — 2) 7^ e and vertices a + l^a + 2, . . . ,a + f + 1 are all adjacent to the vertex a. 

(2) Tr(a — 2) = e and vertices a — l,a + 1, . . . ,a + f are all adjacent to the vertex a. 

Equivalently, ^t^ has an f-fan at a > with 7r(a — 2) ^ e if 7r(a + 1) = 1 and 7r(a + i) = for 
2 < i < f. Similarly, ^-^ has an f-fan at a > with 7r(a — 2) = e if Tr(a -|- 1) = 1 and 7r(a + i) = 
for2<i< f -I. 

Note that there cannot be an /-fan at 0, nor can there be one at a vertex a whenever 7r(a) = e. 
Figure [3] illustrates the structure of /-fans as defined above. 

Definition 4.4. [M-bounded triangular ladders] For M > 3, define Bm to be the set of a// vr G 11 
that do not contain an M-fan and in addition satisfy one of the following conditions: 

(a) IT contains no copy of e and has length at most 3M — 1, or 

(b) TT contains 1 copy of e, has length at most 2M and does not have tt{2M) = e, or 

(c) vr contains 2 copies of e, has length at most M +1 and does not have vr(M + 1) = e. 
i/vr G Bm we say that ^-j^ is an M-bounded triangular ladder. 

It is easily seen that Bm C Bm' for any M < M' . The following additional observation demon- 
strates some of the reasons for the precise choice of constants in the definition of Bm- 

Observation 4.5. The family Bm is closed under the prefix operation. Indeed, let o" be a proper 
prefix of some vr G Bm- Precluding an M-fan from vr clearly also precludes it from a. While a may 



contain fewer copies of e, these would meet relaxed length criteria in Definition 4.4, hence a G Bm 
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Figure 4. Examples of M-bounded triangular ladders for M = 5. Outer boundary edges 
w.r.t. B5 are highlighted in red, side boundary edges are green and initial edges are brown. 



Having defined the family of graphs we wish to track throughout the process, we proceed to 
a classification of the edges in ^j^ for tt G Bm- This will in turn be used to study the effect of 
removing such edges as part of triangles eliminated by our process. 

Definition 4.6 (boundary edges). Let ^-^ he an M-bounded triangular ladder for some M > 3 
and let yz G E{^t^) he such that < y < z and 7r(y) 7^ e. Let 



vr 



vr^.ioO ify<z-l 
vr^-i o 1 ify = z-l 



where tt£ denotes the i-prefix of vr and o denotes string concatenation. We say yz is a boundary 
edge of J£t^ w.r.t. Bm 'i-ff'^' ^ ^m- ^^ other words, an edge is a boundary edge if the ladder obtained 
by removing all vertices larger than z and then attaching a new vertex adjacent to y, z no longer 
belongs to the family Bm- We classify a boundary edge yz into one of two categories: 

• outer boundary edges; z = |7r| + 1 (last vertex in J^-^) and in addition \'k\ is maximal w.r.t. 
its number of copies of e (i.e., |7r is 3M — 1, 2M or M + 1 if it has 0, 1 or 2 such copies resp.). 

• side boundary edges.- all other boundary edges. Necessarily y < z — 1, there is at most 1 copy 
of e in IT, the ladder ^,r has an (M — l)-fan at y whilst ^£^^1 has an M-fan at y. 

To see that indeed side boundary edges are accompanied by vr of the form specified in the above 
definition {y < z — 1, at most 1 copy of e and an Af-fan in vr') argue as follows. First, we claim that 
if vr has 2 copies of e then ^Sfjr cannot have side boundary edges. To show this, recall that in this 
case I vr I < M + 1 and there are 2 copies of e in vrAf . After excluding the case |vr| = M + 1, z = M + 2 
(corresponding to an outer boundary edge) we are left with either 2: = M + lor2;<M + l. In the 
former case, |vr'| = M + 1 and vr' retains both e copies of vr in its M-prefix, thus vr' G Bm by Item ([c]) 
of Definition 4.4, In the latter, |vr'| < M, and either it contains 2 copies of e — here vr' G Bm as 
per the value of |vr'| — or it contains 1 copy of e and again vr' G Bm since ^7^' does not contain an 
M-fan (any /-fan in vr' would have vr' be at least / + 2 symbols long). We have thus established 
that vr can have at most 1 copy of e. Next, recall that by definition vr has no M-fan. The exclusion 
of the cases |vr| = 2M, z = |vr| + 1 with 1 copy of e and |vr| = 3M— 1, z = |vr| + 1 with no copies of e 
(corresponding to outer boundary) implies that |vr'| < |vr|, and yet vr' ^ Bm (since yz is a boundary 
edge), hence it must contain an M-fan. Comparing vr to vr' now implies that vr' = vr^-i o so as 
to accommodate a new M-fan (missing from vr2„i), therefore indeed y < z — 1. Finally, it further 
follows from this analysis that 



y + M 
y + M 



if vr(y 
if vr(y 



/e, 



(4.1) 



We complement the definition of boundary edges by categorizing the each of the remaining edges 
into one of two types: 
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Definition 4.7 (non-boundary edge classes). Let .i^ be an M-bounded triangular ladder and let 
yz be a non-boundary edge of ^j^. We say that yz is an initial edge if y < z and vr(y) = e, and 
otherwise we say that yz is an interior edge. 

Figure |4] depicts the family of 3-bounded triangular ladders, highlighting, outer boundary edges, 
side boundary edges and initial edges in these ladders w.r.t. B3. 

Remark. While /-fans and M-bounded triangular ladders were both defined for values of /, M > 3, 
one could extend these notions to / = M = 2. However, going back to Definition |4.3[ a 2-fan at 
vertex 3 occurs whenever vr begins with elll, as opposed to /-fans for / > 2 which are accompanied 
by non-overlapping patterns of the form 10 ... 0. In light of this anomaly at M = 2, and given that 
our application would have M be arbitrarily large, our attention will be restricted to /, M > 3. 

Definition 4.8 (backward extension). Let .if^ be a triangular ladder that contains some outer 
boundary edge yz. The backward extension from yz to ^^, denoted by B-,^^yz, is defined as the 
graph obtained by deleting from, ^j^ all edges within the distinguished vertex set I = {0, l,y,z}. 

Note that we have the following scalings for these extension graphs 

'n^M-3p6M-3 if 1^1 = 3M - 1 , 



Sb 



n,yz 



n 



2M-2^4M 



p^^'-^ if|7r| = 2M, (4.2) 

Af-l 2Af-l if Ivrl = M + 1 . 



n p 



As in Definition 3.1 , the density of a backward extension graph H is equal to mn = e_H-/(f// — 4), 



as the distinguished vertex set of if is I = {0,l,y,z} for some outer boundary edge yz. To 



qualify an application of Theorem 3.2 on backward extension graphs we need to verify that they 
are balanced (that is, the density of a backward extension graph H is as large as the density of any 
subextension K of H). 

Lemma 4.9. Let J£t^ be an M-bounded triangular ladder for some M > 3, and let yz be an outer 
boundary edge of .^-^ . Then the backward extension graph B^^^yz is balanced. 



Proof. With Definition 4.6 in mind, without loss of generality we have z = |7r| + 1 where |7r| is 
either 3M — 1, 2M or M + 1 depending on whether it has 0, 1 or 2 occurrences of e, respectively. 
We will first prove the statement of the lemma for the case where '7r(l) = e. 

Suppose for the sake of contradiction that B = B^^^yz = (Vb^Eb) is not balanced, and let 
C = {Vc,Ec) be a maximal proper subextension of B such that mc > ms. Let a,b be vertices 
such that [a, b) is the first interval of vertices missing from Vc, that is, j G Vc for 1 < j < a and for 
j = b whereas j ^ Vc for a < j < b. Observe that 2 < a <b < z = \Vb\ since {1, y, z} C Vc Q Vb- 
By the maximality assumption, the extension graph we get by adding a, . . . , 5 — 1 (and all incident 
edges) to C has density at most niB- Let cq be the number of edges within a, . . . , 6 — 1 or connecting 
these vertices to Vc in B (or equivalently in .if^r, since {0,l,y, z} C Vc and so the two edge-sets 
are equal). Since mc > tub we now get 

j^^<mB. (4.3) 

b — a 

As each vertex that joins a triangular ladder has either one or two neighbors among the previous 
vertices, one of which is always the preceding vertex, we also have 

eo > 2(6 - a) + 1 - # {a < j < 6 - 1 : 7r(j - 1) = e} . (4.4) 

In this observation we included the edge {b—1, b) (the -|-1 term in the right hand side) yet disregarded 
another potential edge joining b to some 7 < 6 — 1 (which is present iff -7r(6 — 1) 7^ e). We call 76 
the 'extra' edge, and a careful consideration of when 76 exists will play a role below. Consider now 
2 cases depending on the structure of tt. 
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(1) vr contains one copy of e: 

Here we have \Vb\ = \V{^-n)\ = 2M + 2 and e^ = 2(|Vs| - 2) - 1 = 4M - 1. We claim 
that ly,lz ^ E[^T^) and thus the extension graph B satisfies Eb = E{^t^) \ {yz}. Indeed, 
as the pattern elO is forbidden from any vr E 11, the neighborhood of the vertex 1 is always a 
subset of {2, 3}. Since the form of vr dictates that z = 2M + 1 and that ^vr has no M-fan, in 
particular y > M + 1 > A and so neither y nor z are neighbors of 1, as claimed. Altogether, 
l^^l =e^-l = 4M-2 and 



ms 



\Eb\ 



2 + 



1 



M-1 

1 is present since h 



\Vb\-^ 
Further note that the extra edge 76 with 7 < 6 
7r(6 — 1) 7^ e. Several cases are now possible. 

Ca se (|1 1): a > 2. Then the symbol e does not appear in 7r(a — 1), . . . ,7ri 
from (|43),(44]),(|45]) that 6 - a > M - 1, and as 3 < a < 6 < 2M + 1 we get 

M<h-a<2M-2. 



(4.5) 



1 > a > 2 and so 



2). It follows 



(4.6) 



By Definition 4.4, the ladder ^.j^ does not contain an M-fan, 

l)-fan at 7). Combining 



Examine the extra edge 76. 

implying that 7 > 6 — M (this inequality is tight if there is an (M 

this with the fact b — a > M yields a < 7 < 6 — 1, hence we can adjust (4.4) by the extra edge 

76 to have eg > 2(6 — a) + 2. By (4.6) we now get 

2 

ruB, (4.7) 



eo 



>2 + 



>2 + 



b-a ~ b-a ~ M-1 
contradicting ( |4.3[ ). 

Case ([l|2): a = 2 and 6 > 3. Note that here 6 < 2M + 1 as otherwise Vc = {0, 1, 2Af + 1} 
contradicting the fact that {0,l,y,z} C Vc- Since '7r(l) = e, we have a contribution of —1 
from the last term in ( |4.4[ ). On the other hand, since 6 > 3, the extra edge 76 must feature 
2 < 7 < 6 — 2 (the vertex 1 has only {2, 3} as neighbors in particular excluding 6 > 4). Taking 
this into account we can modify (4.4) into eo > 2(6 — a) + 1. At this point we can immediately 



preclude the case 6 + 1 € Vc- Indeed, if 6 + 1 € Vc* then this vertex has 2 neighbors among 
previous vertices (as 7r(6) 7^ e), one of which must be among 2, . . . , 6 — 1, thus eo > 2(6 — a) + 2. 
At the same time, having 6 + 1 < \Vb\ = 2M + 1 implies that b — a < 2M — 2 and so 
inequality (4.7) is again valid, contradicting (4.3). Assume therefore that 6+ 1 V^ and let 6' 
be the smallest index greater than 6 such that 6 G Vc- Let ei be the number of edges among 
2, . . . , 6 — 1, 6 + 1, . . . , 6' — 1 plus those joining this set with Vc- In place of (4.3) we now write 

, <mB = 2-\ , 

6' - 3 M - 1 ' 

and accounting for the edge 12, two edges going backwards from each of the vertices 3 < 

j < b' — 1 and one edge going back from 6' we arrive at ei > 26' — 4. This implies that 

ei/(6' — 3) > 2 + 2/(6' — 3) > tub, a contradiction. 

Case (fllS): a = 2 and 6 = 3. In this case, the first interval missing from Vc is simply 

the singleton {2}. Recall that \Vb\ = 2M + 2 > 8 and that vr begins with the prefix ell 

(thus the vertex 2 is adjacent to {1,3,4} to the very least in ^n)- Clearly, if 4 £ Vc then 

eo/(6 — o) > 3 > TUB in contradiction with (4.3). Assume therefore that 4 ^ Vc and let 6' 



be the smallest index greater than 3 such that 6' G Vc- Again let ei be the number of edges 
among the vertex 2, 4, . . . , 6' — 1 plus the number of edges joining these vertices with Vc- Instead 



of (4.3) we have 



ei 



<mB = 2 + 



1 



6'-3 ~ ■'" 'M-1 

Counting the edges 12, 23, two edges going backward from each 4 < j < 6' 
the edge (6' - 1,6') we get ei > 26' - 5 



(4.8) 



1 



as well as 
Therefore, it follows from Ksh that 6' > M + 2. 
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This case will be concluded by examining the extra edge 7'^' such that 7' < 6' — 1 (present 
since 7r(6' — 1) 7^ e). Crucially, .^ does not contain an M-fan at the vertex 3 and so 7' 7^ 3 
(if / is the maximal /-fan at 3 then by definition the neighbors of vertex 3 are precisely 
{2, 4, . . . , 3 + /}, a set precluding b'). Since 6' > M + 3 > 6 we further have 7' 7^ 1 and overall 
the extra edge 'j'b' further increments ei yielding ei > 26' — 4. Since b' < 2M + 1 this gives 
ei/ib' - 3) > 2 + 2/(6' - 3) > 2 + 1/(M - 1), contradicting (|48]). 
(2) TT contains two copies of e: 

Here \Vb\ = \V{^^)\ = M + 3 and e^ = 2(|Vb| - 2) - 2 = 2M. Again we claim that 
ly, Iz ^ E{^T^) and hence Eb = E{^t^) \ {yz}. To see this, observe that when vr begins with 
ee the vertex 1 has only 2 as its neighbor, which has a subset of {1,3,4} as its neighbors due 
to the forbidden pattern elO. Therefore, in this case having z = M + 2>f> implies that y > 2 
and ly ^ -E(.i^). Alternatively, if vr has 2 copies of e and yet 7r(2) 7^ e, then the index j > 1 
such that 7r(j) = e in fact satisfies j > 3 (and hence is not a neighbor of 1) due to the forbidden 



patterns eO and ele, and moreover j < M -\- 1 due to Definition 4.4, The fact that the vertex 
j is a cut-vertex of ^-jt now leads to the conclusion that \Eb\ = Cn — 1 = 2M — 1, thus 



TUB 



\Eb\ 



IVbI 



2 + 



1 



M -1 



Case ([2I1): There are no copies of e in {7r(a — 1) 
7r(l) = e. By (4.4) we have 



, 7r(6 — 2)}. Observe that a > 3 since 



eo/{b 



> 2 + 1/(6 - a) > 2 + 1/(M - 1) 



(4.9) 



(where we used the fact that 6 — a<M — 1 as3<a<6<M + 2), contradicting (4.3). 

Case ([2I2): a > 3 and the symbol e appears (once) in {'/r(a — 1), ... , 7r(6 — 2)}. Here there 
are no later occurrences of e and so the extra edge 76 (with 7 < 6 — 1) is present. Moreover, 
7 > £ where a — 1 < i < 6 — 2 is the aforementioned index of e in vr, since, crucially, vertex £ is a 
cut- vertex of ^t^. It now follows that when i > a the extra edge 76 contributes to the count of 
eo and gives cq > 2(6 — a) + 1, which as shown above (see (4.9)) contradicts (4.3). Conversely, 
if -7r(a — 1) = e then the forward neighbors of a — 1 are confined to the subset {a, a + 1}. 
Consequently, as long as 6 > a + 1 we again get that 7 > a and the extra edge 76 again leads to 
the sought inequality eo > 2(6 — a) + 1 that yields a contradiction to ( |4.3[ ). It remains to deal 
with the case 7r(a — 1) = e and 6 = a + 1, where we claim that b < z. To see this, observe that if 
6 were the last vertex of C (vertex z = M + 2) then its neighbors would be {a — 1, a}. However, 
since y j^ a (by the fact that a ^ Vc whereas y must belong to C as it is a distinguished vertex) 
we would then get y = a — 1 and 7r(y) = e, contradicting the fact that yz is a boundary edge 
(see Definition 4.6). It now follows that |7r| > 6 = a + 1 and since 7r(a — 1) = e we must have 
7r(a) = 7r(a+l) = 1 by definition of H. In particular, the edge (a, a + 2) belongs to i?(^7r) (and 
also to B, recalling that a is not a distinguished vertex). To conclude this case argue as follows. 
Either a + 2 G Vc, in which case cq = 3 due to the neighbors {a — 1, a + 1, a + 2} of the vertex 
a and we have eo/(6 — a) = 3 > 2 + 1/(M — 1) as needed. Otherwise, letting 6' be the smallest 
index larger than 6 = a + 1 such that b' £ Vc (which must exist since z = M + 3 G Vc), we can 
consider the set {a, a + 2, . . . , 6' — 1} and denote by ei the number of edges incident to it in C. 
We must have ei/(6' — a — 1) < tub and yet ei > 2(6' — a — 1) + 1 since it includes two edges 
going backward from each of a + 2, . . . , 6' — 1, the two edges (a — 1, a), (a, a + 1) and finally the 
edge (6' - 1, 6'). This yields ei/(6' - a - 1) > 2 + 1/(6' - a - 1), and as 3 < a < 6' < M + 2 we 
get that ei/(6' — a — 1) > 2 + 1/(M — 1) = tub, contradiction. 

Case ([2|3): a = 2 and 6 > 3. If the second occurrence of e does not appear in {tt{2), . . . , 7r(6— 
1)} then the extra edge 76 contributes to eo (we are assured that 16 ^ E[^t^) since 6 > 3) and 
again we obtain eo > 2(6 — a) + 1 to contradict (4.3). (Note that 6 — a<M — 1 in this case — 
hence the contradiction — due to the assumption that vr features a second copy of e in one of the 
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indices {6, . . . , |7r| — 1}, in fact implying that b — a < M — 2). Assume therefore that the second 
occurrence of e does appear in the sequence. Observe that the assertion b < z = M + 2 stih 
holds since otherwise Vc = {1, z} leaving out the distinguished vertex y. Now, if 6+1 G Vc then 
this vertex has a neighbor in 2, . . . , 6 — 1, and the corresponding edge cancels the contribution of 



the extra copy of e to the right hand side of (4.4). This again gives cq > 2(6 — a) + 1 and leads 



to the same contradiction (recall that 6 < M + 1 and so 6 — a<M — 1). It remains to deal with 
the case 6 + 1 ^ Vc- Again let 6' be the minimal index such that b < b' £ Vc and let ei count 
the number of edges incident to 2, . . . , 6 — 1, 6 + 1, . . . , 6' — 1, we must have ei/(6' — 3) < ms- 
However, ei includes two edges going backward from all but two of these 6' — 3 vertices (those 
corresponding to copies of e that contribute just one backward edge), the extra edge 76 and 
the edges (6, 6-1) and (6' - 1, 6'). Overah, ei > 2(6' - 3) + 1 > 2 + 1/(M - 1) = m^ as before. 

Case ([2I4): a = 2 and 6 = 3. Recall that the last vertex in Vc is z = M + 2 > 5, and 
examine the vertex 4. Regardless of whether vr begins with ell or eell, the vertex 2 has {1, 3, 4} 
as its neighbors in E{^-,^) and so if 4 G Vc we immediately get that eo/(6 — a) = 3 > niB-, 
contradiction. Assume therefore that 4 ^ Vc", let 6' > 4 be the minimal index beyond 4 with 
6' G Vc (bearing in mind that z £ Vc by definition) and let ei be the number of edges incident 
to 2,4, ...,6'-l in Vb. 

If the symbol e does not appear in {7r(4), . . . ,7r(6' — 2)} (it is forbidden from occurring at 
7r(3) as per the definition of 11) then ei > 26' — 5, since the vertices 4, . . . , 6' — 1 together send 
2(6' — 4) edges backwards, accompanied by the edges 12, 23 and (6' — 1,6'). Thus, in this 
case ei/(6' — 3) > 2 + 1/(6' — 3) > niB, as desired. Finally, if the symbol e does appear in 
{7r(4), . . . , 7r(6' — 2)} then the extra edge j'b' (satisfying 7' < 6' — 1) is present. Moreover, if we 
let 4 < J < 6' — 2 be the index such that tt{j) = e then 7' > j since the vertex j is a cut- vertex 
in .ifjr- As such, the edge 7'6' contributes to ei, thus canceling the negative contribution of the 
extra e symbol. That is, again ei > 26' — 5, eliminating this case. 

All cases above led to a contradiction, completing the proof of the lemma for ladders beginning 
with e. It remains to prove the balance property for ladders beginning with 1, which will follow 
from similar arguments yet here the vertex will play a delicate role. 

(a) vr contains no copies of e: Here Vn = 3M + 1 and Ctt = 6M — 2, and therefore |Vb\-/^b| = 3M — 3 
(here we used the fact that y ^ 0,1 — and thus \Ib\ =4 — since z = 3M and forbidding an 
M-fan implies that y>z-M = 2M) and \Eb\=QM -?,. Therefore 

6M-3 1 

^--B = TTITT 7; = 2 + 



3M - 3 M - 1 

As before, if we consider the first interval {a, . . . , 6 — 1} missing from Vc we must have eo/(6 — 
a) < ruB- The lack of copies of e implies that cq > 2(6 — a) + 1 (with the additive 1 accounting 
for the edge (6—1,6) as usual) and so 6 — a>M — 1. This in turn implies that b — a > M 
and so the extra edge 76 lands in the interval [a, 6) (since j > b — M due to the forbidden 
M-fans). As such, we cannot possibly have b = z since both of its neighbors are missing from 
Vc (whereas y must belong to Vc as a distinguished vertex). Therefore, b < z — 1 = 3M — 1 
and, since a > 2 (again recall that 1 is distinguished) we have b — a < 3M — 3. Furthermore, 
this entitles us to consider the vertex 6 + 1. One more edge incident to this interval is needed 
to secure the contradiction: if 6 + 1 G Vc* it immediately provides such an edge, since its two 
neighbors are 7' < 6 and 7' > 7 > a. In this case we get 

en 3 3 

b — a b — a 6M — 3 

as required. On the other hand, if 6 + 1 ^ Vc we can again look at the next missing interval 
{6 + 1, . . . , 6' - 1} where 6' G Vc satisfies 6' < z = 3M (hence 6' - a < 3M - 2). The edges 
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(6 — 1, b),^b, (b' — 1, b') now give the required contradiction since 

ei 3 3 

> 2 + T, 7 > 2 + TTTT ^ = rriB . 



5'-a-l- U -a-l - 3M-3 

(b) vr contains 1 copy of e: We have w^ = 2M + 2 and e^^ = 4M - 1, thus |Vb \ /b| = 2M - 2 

(again y / 0, 1 since z = 2M + 1 and y>z — M = M + 1) and |-Eb| = 4M — 2, and as before 

we obtain 

4M-2 1 

"'^ = 2M^ = ^+M^- 

If e ^ {7r(a— 1), . . . , nib — 2)} then exactly as before we have cq > 2(6 — a) + 1, implying that 
b — a > M and so, if present, the extra edge 76 has an endpoint in [a, b). At the same time, 
b < z-\ = 2M and so b-a < 2M-2 and we can conclude that eo/{b-a) > 2+2/(2M-2) = ms, 
contradiction. However, 76 may not exist — this is the case iff 7r(6 — 1) = e. Again then 
necessarily b < z and we can turn to the vertex 6 + 1. If in Vc", the enforced pattern ell 
implies that (6 — 1,6 + 1) belongs to Eb and, counting this in place of 76 we get eo/(6 — a) > 
2 + 2/{2M — 2) as before. If 6 + 1 ^ Vc then joining [a, 6) to the next missing interval [6 + 1, 6') 
gives ei/(6' — a — 1) > 2 + 2/(6' — a — 1) on account of the edges (6 — 1, 6) and (6' — 1, 6'), and 
the desired inequality ei/(6' — a — 1) > m^ follows from the fact that b' — a < z — 2 = 2M — 1. 

If e G {vr(o — 1), ... , 7r(6 — 2)} we can only say that cq > 2(6 — a) since the contribution of 
the edge (6 — 1, 6) is negated by the shortage of one edge due to the e symbol. Nevertheless, if 
the extra edge 76 is such that 7 G [a, 6) then cq > 2(6 — a) + 1 and the argument used above 
implies that b — a > M and b < z — 1 = 2M. We now consider 6 + 1: it has a neighbor 7' 
with 7 < 7' < 6, in particular satisfying 7' G [a, 6). Thus, if 6 + 1 G Vc" we are done by having 
eo/(6 — a) > 2 + 2/(6 — a) > rriB- If 6 + 1 ^ Vc then together with the next missing interval we 
have ei/(6' — a — 1) > 2 + 2/(6' — a — 1) (accounting for the edges 76, (6 — 1, 6) and (6' — 1, 6') 
and subtracting one edge for the e symbol), and the fact that b' — a<z — 2 = 2M — 1 now 
gives the sought inequality ei/(6' — a — 1) > 2 + 1/(M — 1) = ruB- 

Finally, if there is an e symbol in {vr(a — 1), ... , 7r(6— 2)} and yet the extra edge 76 leaps over 
the interval [a, 6) then necessarily 7r(a — 1) = e (since having tt{j) = e makes j a cut-vertex) 
and 7 = a — 1. This can only occur when 6 = a + 1, and since neither of its neighbors can play 
the role of y (having a ^ Vc and 7r(7) = e) we can infer that b < z. The symbol e must be 
followed by 11 (in lieu of any additional e symbols in vr) and so the edge (a, 6 + 1) belongs to 
^Tt (and to B, since a 7^ y). Altogether, eo = 3 on account of the neighbors {a — 1, a+ 1, a + 2} 
of a and so eo/(6 — a) = 3 > niB- 

(c) vr contains 2 copies of e: Here Vt^ = M + 3 and e.^ = 2M, leading to \Vb \ Ib\ = M — 1 (now 
z = M + 2 and y > z - M = 2) and \Eb\ =2M -I. We get that 

2M-1 1 

"i-B = -TT ;- = 2 + 



M-1 M-1 

Let < s < 2 denote the number of e symbols in {vr(a — 1), ... , 7r(6 — 2)}. 

If s = then cq > 2(6 — a) + 1 and so 6 — a > M — 1. In this case either 7 G [a, 6) or the edge 
76 does not exist, either way precluding 6 from being equal to z, thus b—a < {z — l) — 2 = M — 1, 
contradiction. 

If s = 1 then a-priori we only have cq > 2(6 — a). If the extra edge 76 falls in [a, 6 — 2] then 
eo/(6 — a) > 2 + 1/(6 — a) and again 6 — a > M — 1, while at the same time b < z and so 
6 — a < M — 1. If the extra edge leaps over [a, 6) then necessarily 7r(a — 1) = e and 6 = a + 1. 
As before, here we must have 3 edges incident to a and so eo/(6 — a) = 3 > m^. If there is no 
extra edge 76 then 7r(6 — 1) = e and 6<z— 1 = M + 1, thus we can again appeal to 6 + 1 and 
either get an extra edge (6 — 1,6 + 1) from the pattern ell whenever 6 + 1 G Vc (leading to 
eo/(6 — a) > 2 + 1/(6 — a) > 2 + 1/(M — 1) = itib), or join [a, 6) with the next missing interval 
[6 + 1,6') and for an extra edge (6' -1,6') (leading to ei/(6'-a-l) > 2 + 1/(6'- a- 1) > ms). 
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It remains to treat s = 2, in wliich case clearly 7 G [a, b) and so b < z. If & + 1 G Vc 
then it has a neighbor 7' with 7 < 7' < & and we arrive at cq > 2(6 — a) + 1 (accounting for 
(6—1, 6), 76, (7', 6+1) while subtracting 2 for the e symbols). We have b—a < {z — l) — 2 = M—1 
and so eo/(6— a) > 2+l/(M— 1) = rriB, as needed. The final case has 6+1 ^ Vc, and combining 
{a, . . . , 6 — 1} with the next missing interval {6 + 1, . . . , 6' — 1} now gives ei > 2(6' — a — 1) + 1, 
where we counted the edges (6 — 1, 6), 76, (6' — 1, 6') and subtracted two edges for e symbols. We 
have b' — a<z — 2 = M and ei/(6' — a — 1) > 2 + 1/(M — 1) = ms, concluding the proof. ■ 

Let M > 3 be some arbitrarily large constant. We aim to establish control over the number of 
labeled copies of every M-bounded triangular ladder along the triangle removal process as long as 

p>PM:=n^'2 + Ji. (4.10) 



Recalling (4.2) observe that Sh{p) > n for any backward extension H as long as we have p > pM, 



with room to spare (more precisely, Sh{pm) ^ n^'"^ ^'^^ when the corresponding ladder ^^ has 



|7r| = M + 1 and is even larger when |7r| = 2M or |7r| = 3M — 1). Since C, as defined in (2.5) 
satisfies C,^^ = o{y/n), we have Sh > (^^^ for any t{p) with p < pM and any < (5 < 1. In 
particular, in this regime t < t~^{\/5) for any backward extension H, thus we may invoke Part ^ 
of Theorem 3.2 to control the variables ^ H,ip for all the extension graphs in Bm (each of which has 



at most L = 2>M + 1 vertices). Further note that at t{pM) we have C(t) = n i/*^+o(i)^ thus the 



decay rate of C satisfies Eq. (3.2) and we can conclude: 

Corollary 4.10. For every M > 3 there is some < 5{M) < 1 such that w.h.p. the following 
holds: For every backward extension graph B = B-j^^yz with vr G .if,r and any corresponding injection 
Lp we have \^B,ip — Sb\ < SbC o,s long as t < t-i, and p{t) > pu- 

Backward extensions are only defined for outer boundary edges. For all other edges (side bound- 
ary, initial or interior) we will need to keep track of the following. 

Definition 4.11 (forward extension). Let J£-^ he an M-bounded triangular ladder and let yz G 
E{^T^) with y < z be an edge that is not an outer boundary edge. The forward extension from 
yz to ^T^, denoted by F^^^yz, is the graph obtained by deleting from S£t^ all edges between vertices in 
the following set I of distinguished vertices: 

• // yz is an initial edge then I = {0, 1, . . . , y} U {z}. 

• // yz is a side boundary edge then I = {0, 1, . . . , y} U {z} in case 7r(y — 2) 7^ e and otherwise 
I = {0,l,...,y-2}U{y,z}. 

• If yz is an interior edge then I = {0, 1, . . . , z}. 

Similarly to the case of backward extension graphs, estimates to within a mu ltipl icative factor 
of 1 + 0(C ) over forward extension graphs can be readily derived from Theorem 3.2 



Corollary 4.12. Let M > 3 and let < 5 < 1 be the constant from Corollary 4-lC\ Then w.h.p. 
for every forward extension graph F = F^^yz with n G Bm o,nd any corresponding injection (p we 



have ^F,ip = (1 + 0{C, ))Sf as long as t < r* and p{t) > pM- Furthermore, when yz is not a side 
boundary edge the stronger bound ^F,ifi = (1 + 0{C))Sf holds throughout this regime. 

Proof. Suppose first that yz is an interior edge of Jfyr- We wish to count all possible mappings 
V' G ^F,ip by iteratively exposing the images of 2; + 1, ... , \V{.^Tt)\ in Vq- Following Definition 4.1 



of triangular ladders we see that every vertex j is attached to either 1 or 2 vertices among its 
predecessors {0, 1, . . . , j — 1} (depending on whether or not we then encounter the symbol e in vr). 
Therefore, given V'(l)) • • ■ > i^iJ ~ 1) foi' some j > z, the number of possibilities for ^l^{j) is dictated 



by either Yu for some u G {V'(l)) • • • ^i^U ~ 1)} or by Yu^v for some u,v in this set. Using (3.1) to 
control these variables up to a relative error of 1 + 0(C) and iterating this expansion we get 

*F,^ = (1 + 0{\Vi^^)\ C))SF = il + 0(0) Sf . (4.11) 
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This argument remains valid of course when yz is an initial edge with y = z — 1, since again we 
begin with the distinguished set / = {0, 1, . . . , z} whose images are predetermined by (p. On the 
other hand, if yz is an initial edge with y < z — \ then necessarily y = z — 2 (to see this observe 



that 7r(y) = e by Definition 4.7 and thus, since vr does not contain the substring elO, the potential 
forward neighbors of y are confined to the set {y + l,y + 2}) and similarly the neighbors of 2 — 1 in 
^7r are precisely y, z. Thus, the number of possibilities for ip{z — 1) is given by l"^(y)^(^) and from 



this point on we can iteratively expose the images of z + 1, . . . , |y(^,r)| as above to establish (4.11). 

It remains to deal with the case where yz is a side boundary edge, which as we next describe 

demonstrates the necessity for a careful analysis of /-fans. We focus on the case 7r(y — 2) 7^ e, noting 

that the case 7r{y — 2) = e will follow by a nearly identical argument. Recall that the distinguished 



vertex set is now / = {0, 1, . . . ,y} U {z}. By Definition 4.6, here we have y < z — 1, there is at 
most 1 occurrence of e in vr and the ladder ^n^^i contains an (M — l)-fan at y. Therefore, if we 
attempt to construct a copy ip S ^F,!fi iteratively as before, upon reaching the vertex 2; — 1 we will 
find that its neighbors in ^7^ are precisely {y, z — 2,z} and so, once the images of these vertices 
are preset, 'ip{z — 1) is given by some variable N^vw which we may very well have no control over 
(our approximation within a relative error of 1 + 0(C) for these variables ceases to be valid once 
p < n~^'^~^°^^' as the triple co- neighborhoods become merely poly- logarithmic in size). 

To remedy this issue we will rely on Theorem |3.2[ Let K be the subextension of F that has 
Vk = {0, 1, ... , z}. Since Ik = If = {0, 1, . . . , y} U {z}, each vertex j G {y + 2, . . . , z — 1} has 
{y,j — l,i + 1} as its neighbors in K. We can temporarily think of these edges as being directed 
by letting each such vertex j send one edge to j + 1 and another to y, while the vertex y + 1 
does the same and sends an additional edge to some i < y. With this view it becomes clear that 
ex = 2{vk — l-k) + ^ and furthermore that i^ is a (strictly) balanced extension graph. To conclude 
the proof, note that vk — I'K = z — y — 1 whereas z = y + M since z is the final vertex of an 



(M — l)-fan at y (see Eq. (4.1) bearing in mind that we are in the case 7r(y — 2) 7^ e). Plugging 
this into the expression for vk — i^K we obtain that Sk = n p^ . Hence, as long as p > pM 
we have Sk > n^^^~'^)l\^^'^> . This translates into a lower bound of at least n^'^ by the assumption 
M > 3 and, since C~^ = o{^/n), this implies in turn that t < t^{VS) for any < 5 < 1. We can 



therefore appeal to Part M of Theorem 3.2 and get that 



\^K,ip — Sk\ < SkC 



where 5 = 6{M) is precisely the same constant as in Corollary 4.10 From this point we can 
complete K into a copy of F by iteratively enumerating over the options for z + 1, . . . , \V{^Tr)\ as 
before, each time at a cost of a multiplicative factor of 1 + 0{C). Altogether we deduce that 

I^F,^ - Sf\ < (C' + 0{C))Sf = 0{C')Sf , 
as required. Essentially the same argument extends this estimate also to the case of a side boundary 



edge yz with7r(y— 2) = e. Now Ip = {0, 1, . . . ,y—2}D{y,z} yet z = y+M—1 (see (4.1)) and a closer 
look reveals that the vertices y — l,y + l,...,2; forming the (M — l)-fan play an identical role to that 
assumed by vertices y + 1, . . . , z in the previous case (e.g., each can be viewed as sending one edge 
to its successor in this list and another to y, the first vertex y — 1 sends one additional to i = y — 2 
and so on). Therefore, the subextension K is again strictly balanced and has ck = 2{vk — i-k) + 1- 
Having vk — I'K = z — y and 2; = y + M — Iwe again get Sk = n^'^p^^^'^ > n^'^ for all p > pM, 
and the rest of the argument holds unmodified. ■ 

5. Tracking triangular ladders 

5.1. Expected change. For a labeled undirected graph H and u,v £Vg write ^h,uv = ^H^uvit) 
to denote the random variable ^H,ip where (p maps 1— )■ n and 1 1— t- f . If H = ^^^ we will use the 
abbreviated form ^tt,™, and as usual we use f^ and e,r to denote |y(^,r)| and |i?(.if7r)|, respectively. 
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Our aim in this subsection is to estimate the expected change in variables ^n,uv within one step of 
the process for any M-bounded triangular ladder. 



V \[l ^ VTf 



Theorem 5.1. Let M > 3 and let < 6 < 1 be the constant from Corollaries 4-10 and 4-12. For 
any it G Bm 1st 7r~ denote its (|7r| — l)-prefix and define 

where ^0 ^^ := 1. Let C = n^^''^p^^ logn and for some absolute constant k > let 

T* = t*(k) = min{t : |Q/(|nV) - l| > «C'} • 

Let i be some time such that t{i) < Tq and p{i) > pM- Suppose that ^t^^uv = (1 + o{l))S-,^ holds at 
time i for all vr G Bm o-nd all u,v £ Vq- Then 

e O (C^+^] 

E[AX,,„, I Ti] = -(6 - o(l))^X,,™ + So(7r) + Hi(7r) + ^\ > S^ (5.1) 

for every vr G Bm o-nd every u,v £ Vg, where the correction term Hi(7r) is given by 

^ (^) = 6 + ^(1) ^^ J 

^^^ ' n^p^ 7roi,Mi) |(|7r|, |7r| + 1) is an initial/interior edge\ ' 

whereas if there exists y < \tt\ such that (y, |7r| + 1) G E{J^t^) then 

6 + 0(1) 

n^p 
6 + 0(1) 



^ ' ~ ~i^p '^'^ {(y, |7r| + 1) is an initial edgej 



fi3p3 TToO,uv i^y^ 1^1 _|_ -j^-j j^g ^^ interior edge^ 
6 + 0(1) S^ 



^■"^ St, _^^^^'^«-i°^1'™ ""{(y, |7r| + 1) zs a side boundary edge, 7r(y - 2) / e} 



6 + 0(1) 5^ 



^7I"u-20el,MtI -"- 



^V 5^,_2oe ^'"'°'^'™ {(y- Kl + 1) ^s « Side boundary edge, 7r{y - 2) = e} 
and otherwise Ho(vr) = 0. 

Our proof will crucially rely on the properties of triangular ladders in order to express the number 
of copies of ^TT that rest on a given edge in terms of the total number of other triangular ladders. To 
demonstrate this we use the following simple lemma of j2| whose proof is provided for completeness. 

Lemma 5.2. Suppose {xi)i,^[ and {yi)i<^i are real numbers so that \xi — x\ < 5i and \yi — y\ < 62 
for some x,y £M. and every i £ I. Then 



5^a;iy, -— (j^Xi)(^y, 



1^, 
iei iei iei 



<2\I\6i62. 



Proof. By the triangle inequality, |X]jg/(a;i — x){yi — y)\ < |/|(^i52, and at the same time it is easy 
to see that J2iei(^i ~ ^)iyi ~ v) ^^ precisely equal to 

The hypothesis on the Xj's and y^'s implies that the last term above is at most |/|(5i(52 in absolute 
value, from which the desired result follows. ■ 



Prior to commencing with the proof of Theorem 5.1 we wish to illustrate the role that the 
backward/forward extensions play in the analysis of the variables '^n,uv ■ 
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Example 5.3 (expected one-step change in ^ee)- In general, in order to estimate the one-step 
change in ^n,uv we examine the edges of the given ladder and for each one consider the probability 
of hitting this edge (as part of some labeled copy of the given ladder rooted at u,v £ Vg) by the 
current triangle as well as the number of copies of the ladder that it participates in. For ^gg we 
write separate summations for the two edges 12 and 23 to get 

a&Nu aeNu 

beNa\{u} 



■E 

aeNu 



+ ^ I n n I '-'ee ; 



Q Q \n^p 



i3n3 



where the indicators l{uaeE} treat the double counting of a triangle that contains both these edges, 
and the last equality rewrote their sum as 0{See/{n'^p^)) via the hypothesis on the degrees and Q. 



We now apply Lemma 5.2 with I = Nu, the sequence Xa = Yu^a with x = np^ and 6i = np'^0{C) 
and the sequence ya = Ya with y = np and 62 = npO{C) (the variables "^■k,uv count labeled graphs, 
thus for instance Ylai^N ^u,a = ^ei,™)- Since \I\5i52/Q = 0{C,'^p) while 5ee = n^p^ this gives 



YuQ Q n^p 



(The 0(^5--r)-term was absorbed by the 0(-T-)-term as the latter has order °f f by Def. (2.5). 



We stress that many of our applications of Lemma 5.2 have this common form. We are computing 
the expected change in some variable '^■k^uv due to the loss of a particular edge in the triangular 
ladder. We compute this as a sum over such edges yz G E{^££.,^'). For each one we break up 
the ladder ^^^ into a smaller subgraph K that contains n, v and the edge yz, then proceed to 
(i) count the number of labeled copies of iiT in G rooted at u, v (thereby yielding an estimate on the 
probability of removing such an edge in the current round) , denoted by some '^k,uv , and (ii) count 
the number of extensions from K to J^-j^ in G (giving the total number of copies of ^-j^ eliminated 
by removing the edge under consideration), denoted by some ^_^^/k- Observe that, by definition, 
J^ibe^ ' ^^n/K,i> = ^TT,™- Moreover, given the injection i/j : K ^- Vq the probability to remove 
ip{y)ip{z) is equal to ^^(j,)^(2)/Q, thus we arrive at the expression 



Applying Lemma 5.2 on this summation we now get a main term which is a product of ^n,ti! (the 
result of ^^ '^^T,/K,ip) ai^d '^K',uvi where K' is obtained from K by attaching to it a new vertex 
incident to the edge yz (the result of X^^ ^^(j;)^(2))- We now see the reason behind our definition 
of the family of triangular ladders, since if i(' is a triangular ladder then K' will commonly also 
belong to our family of tracked variables. As an error term in this analysis we will sustain the 
approximation error in l'^(j;)^(^)/Q, i.e., an 0(C/(^^p)) term, multiplied by ^k,uv and then by 
the approximation error for ^^^/x. Usually we will choose K so that J£-^IK will be the forward 
extension F-n-^y^ introduced in the previous section, whose corresponding variable ^F,if> was estimated 
there to within an error of 0(C,^Sp uniformly over the injection Lp. Overall, we get the desired 
0(C^ /(?T-^p))^7r,uD error-term. 

We are now ready to prove the main result of this subsection. 

Proof of Theorem 15. li By hypothesis we control the variables Yu = '^e,uv, Yu^v = ^i,uv and Q to 



within a multiplicative factor of 1 + 0{C,). Via Corollaries 4.10 and 4.12 this guarantees estimates to 
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within a factor of 1 + 0(C ) in backward and forward extension variables -B-tt.j/^ and F.,^,yz- (Forward 
extensions of initial/interior edges feature a better approximation of 1 + 0(C)-) 

As mentioned above, we will analyze the different possible edges yz G E{^tj) whose images in 
E{G) (under any mapping ^ G ^■k,uv) niay be removed in the upcoming step, and the possible 
impact this event would have on Xj^^uv 

Given Ti, for each tp G ^-k^uv we define the events 

1?^ = M {^SV^ ■ yz G £'(Jf,r)} where SY = {ip{y)'il^{z) is removed at time i} , (5-2) 

and for each edge yz G E{^.^) further set 

^yz^n,uv = — 2_^ -"-^F • 

According to these definitions, A^tt,™ = — Z]»/;e<i'^ „„ 1^^- ^^ ^^^ same time, we crucially have 

F{£^\Ti)= Yl n£l'\J'^) + 0{l/Q) (5.3) 

yzeE{Jf^) 

since P(<Sl^ , <f I ^ | Ti) < 1/Q for any distinct edges yz,y'z' G E{^-,^) (the event that both are 
removed identifies a single triangle in G), and there are O(e^) = 0(1) such pairs. It follows that 

(5.4) 

using the fact that Q x n^p^ whereas C~^n^p x n^p^ log n. Our ultimate goal in this theorem is 
to estimate the one-step change in X^r,™ = ^tt,™ — ('57r/5^-)*I',r-,MD- ^s usual we write the change 
from time i to time z + 1 as 

Since we have St^/Sj^- = np where = 1 if the last symbol in vr is e and otherwise 6 = 2, the 
change in the last term is deterministically given by 

^) = -«„/-. (4) (i - o(^)) *.-,„„ = _5izi«!)|^^^_ „„. 

^o^-/ \n^/ \ \n'^p/ J w^p o^- 

relying in the last equality on the fact that C,~^ = n^~°^ 'p"^ = o{n?'p). Adding the similar deter- 
ministic estimate (5,^/5^- )(i) = (1 — o{C,'^)){St^ / S^-){i — 1) we can rewrite the equation for /S.Xt^^uv 
as 

AX^,„, = A^^,„, - (1 + o{e))^A^^-,u. + ^^^^1^^.-,™ ■ (5.5) 

5^- n^p S^- 

Finally, bearing in mind that 9 counts the number of edges that are incident to the vertex |7r| + 1, 



we can break up the expected change in A$^^„t, and A$^-^^^ as per equation with (5.4) and get 



E[AX^,™ \F.-^= Y (^ [^yz^^,uv I H - (1 - o(C^))^E [Ayz^^-,uv I J'i]) 
y<z<\iT\ 

+ V fE[A M/ 1^1 , 6-o(a 5^ ^+^^ 

' / , I I'-^yZ^ TT,UV -^ jj ^ r- ^ TT^ .UV I ' O ^ TV ,UV ■ 

yz<=iE{X^) ^ F TT / F 

?/<z=|7r| + l 

(Notice that the first sum goes over all edges of ■^■^~, leaving Q edges of S(^7r) \ E{^^^-) for the 



second sum. This allows us to divide the last term of (5.5) to each of these latter summands.) 
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By assumption we have {ST^/S^-)'if^-^uy = (1 + o(l))^,r,ui) and so the o(C^) error terms above are 
easily absorbed in the last o(C^) term to give the following: 



yz£E{^^) \ ^ / 

y<z<\iT\ 

+ V (E[Ay,^^,uv\H + -^4^'^7r-uv]+%^'^n,uv (5.6) 

y<z=\TT\+l 

This identity becomes particularly useful in light of the fact that if yz G E{^^-) then yz has the 
same edge classification in both ^-j^ and ^t^- ■ (Indeed, the criterion for classifying yz as a boundary 
edge is a function of ttz-i and the values of y, z, whereas initial edges are identified by vr(y).) We 



now proceed to estimate the appropriate quantities in (5.6) for each yz S E{^t^) depending on the 
type of the edge under consideration. 

• Initial edges: Ifyz is an initial edge with y < z then 7r(y) = e and z £ {y + l,y + 2}. Consider the 
case z = y + 1 and examine vTy, the length-y prefix of vr. We will count the number of (labeled, 
rooted at u, v) copies of ^-i^y to estimate the number of edges that can play the role of yz in G. 
Given some mapping if £ "^nyMv, in the event that we remove the edge (p{y)ip{z) the number of 
affected copies V' £ ^tt,uv (i.e., those agreeing with ip on y,z) is simply the number of forward 
extensions F = F^^^yz from ip to the full copy of ^-,^. Thus, 

E[A * 1^-1-- V ^^(^^)^(^)^-^''^ 

For a fixed ip G ^TTy,uv, the probability of removing the image of yz is ^^(j;),^(z)/<3, whereas the 
sum over if G ^-Ky^uv of V^(j,)<^(z) is "^-Kyoi.uv We claim that vr^ o 1 g Bm- For y = 1 indeed 
el G Bm- For y > 1, clearly vr has either G^^ G {1,2} copies of e. Going back to the definition of 
Bm we see that, as vr G Bm-, necessarily |7r| < l where £ = 2M \i t-,^ = 1 and I = M + 1 \i itj^ = 2. 
Moreover, y 7^ £ so as to avoid 7r(£) = e, thus y < i and it follows that vTy o 1 has length at most 
i, at most G^vr copies of e and a last symbol of 1, thus vTy o 1 G Bm as claimed. 

As stated before, ^ ^F,i^ gives ^tt,uv itself. Altogether, an application of Lemma 



5.2 



gives 



^Try,uvW ^ P 

where in the notation of that lemma we substituted |/| = "J ^ny^uv , Si = 0{C,)/{n?'p) and 82 = 
0{C,)Sf uniformly over the injection p thanks to Corollary |4.12| (note that yz is not a side 
boundary edge), thus \I\5i52 = 0(CV(™'p)) E^g*.,,„„ ^f,^ = 0{^ /{n'p))^^,uv- 

Now consider the case of an initial edge yz where z = y + 2. If we wanted to use argument 
analogous to the one above we would examine the ladder ^t^^^^. However, in this case the 
sum over ip G ^7r,_i,™ of >"^(j;)^(2) would give ^,r,_ioO,™, and tt^.i o ^ Bm since ^{y) = e 
and the substrings elO and eO are forbidden. A closer look at ^■k^_^ reveals that the vertices 
y + l,2: = y + 2 play identical roles in this graph — they are connected to one another and each 
shares an edge with y. By switching their labels we can examine the graph J£-„y where the role 
of z will be played by the isomorphic y + 1. This gives 

'^i'-^yz^ -K ,uv I -' ij — / ^ ^ ) 

where pJ above stands for the composition of (/? and the map z 1— >• y + 1. One should note that 
the above equality used the fact that -^77,^2 has the distinguished vertex set {1, . . . , y} U {z} thus 



TV.UV 
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vertex y + 1 that was not mapped in ip' is indeed added to form a full copy of ^j^. Henceforth 
the analysis of this summation can be carried out exactly as in the above case of z = y + 1, and 
we conclude that Eq. (5.7) holds also for the case of z = y + 2. 

Having established (5.7) for all initial edges, we now wish to estimate the contribution to 
IE[AX I J^i] for such edges. This will depend on y, z as per the following three cases: 

(1) y < z < |7r|: From (5.7), applied both to '^■jr^uv and to ^^- ,^^, we have 

F f A \D I TTl _ _^F [A \I/ I -rl - ^^a°l."^ y , ^ (O ^t. 

''-^ L !/2^7r,ut) I -'t\ (-, ''-^ I'-^yz^ n~ ,uv I -'i] — t ^^^Tr,uv ~r 2 ^-rT,uv 

__6-o(l) ,^m^ (5 81 

where the last equality applied our hypothesis that ^o-,™ = (1 + o(l))5o- for any a £ Bm 
(namely for iTy and vr^ o 1). 

(2) y = |7r| and z = |7r| + 1: Here vr^ = vr, thus the main term in the right-hand-side of (5.7) 
simplifies into -^Troi,uv/Q- Writing ^■,roi,uv = -^^ttoI,™ + iS-,roi/ST,)^.^^uv we get 

--Av 6-0(1) , o(C^).,. .... 

^Zp ri'^p'^ n'^p 

(The final error term above absorbed a 1 + O(C^) factor due to the approximation of Q.) 

(3) y = |7r| — 1 and z = |7r| + 1: Here -Ky = vr^ and vr^ o 1 = vr (we must have 7r(y + 1) = 1 to 
allow such an edge yz). The identity ^tt.md = -'^tt,™ + {St^/S^-)'^^-^^^ now yields 

n^pS^- ^n-,uvQ n^pS^^ n-'p 

_ 12+0(1) ,o{e) 

where we applied the facts Q = (g + 0(C^))n^p^ and ^7r,™/^7r-,™ = (1 + o(l))np^. 
• Interior edges: First consider an interior edge yz where y < ^; < |vr|. Here we let -^w^^i be the 
intermediate graph containing yz. As before, for each ip G ^n^-i,uv the probability of choosing a 
triangle that contains ip{y)ip{z) is ^^(j;),^(z)/<3, and the number of extensions in ^tt.mv that are lost 
with this choice is the number of forward extensions from the fixed ip to F = F^^^yz- Therefore, 
as in the argument for initial edges we get 

EfA * 1^-1-- V ^^fa)V(^)^^-^ - ^-.-loby.,uv , 2K!l,J 



UV ) 



where 6j^2 = if y < z — 1 whereas 6^2 = 1 if y = -z — 1- Observe that vr^^i o byz G Bm by the 
characterization of yz as an interior edge, hence we have control over ^TT^-iobyz,uv and so 

E [Ayz^^,uv I -F,] - ^E [A,,M/,-,„, I Ti] = _ ^-^-i°V^^ ^ ^ ^lip_^ 

Now consider an interior edge yz with y < z = \7r\ + 1. Again let byz = Oify<z — 1 and 
byz = 1 ii y = z — 1. Since here tTz-i = vr there is no need to enumerate over extensions from an 
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intermediate graph to ^t^ and we simply have, due to our 1 + O(C^) control over Q, 



[A \T/ I TTl — '^Tmhy^^um _ -^Tvobyz,uv + np ^n,uv _ ^^ ^ ^^^2^^ "^^"^z.™ 6 + Q(C ) ,t, 

In particular, 

71 p »->7j-— Tl p 

w^p n'^p'^ " ' w^p 

Observe that this last equation provides a large self-correcting term, and yet we see that errors 
in X^oO.Mt) and X^^^i^uv will influence the error in Xt^^uv 

Side boundary edges with a standard fan: Let yz be a side boundary edge such that y < z and 
7r(y — 2) 7^ e (the case 7r(y — 2) = e, where the fan captured by the corresponding side boundary 
edge has an exceptional structure, will be treated separately). 

Recall that in this case there are Ctt S {0, 1} copies of e in vr and |7r| < 3M — 1 — (M — l)G^7r. 



Moreover, y > and z = y + M following Eq. (4.1 ), and z < 3M — 1 — (M — l)^,^ as otherwise 



yz would be classified as an outer boundary edge rather than a side boundary edge. From these 
facts we infer that %-i o el S Bm since this string contains 6^,^ + 1 copies of e, its last symbol is 
1 (as opposed to e) and it has length y + 1 = z - M + 1 < 3M - 1 - {M - 1){(L^ + 1). 

Similar to one of the arguments used for initial edges, we will let y+l in the graph .i?7rj,_ioe play 
the role of z. To be precise, for each cp £ ^j^ _ioe,uv let ip' be the composition of (/? and the map 
z I— )• y + 1. By the definition of the forward extension F = F^^^yz as having Ip = {1, . . . ,y}U {z}, 
the variable ^F,ifi' exactly captures the number of ways to extend ip' to a full copy of ^.j^, thus 



¥^£'J^7r„_;LOe,til! 



Q 



"y- 



Since E,^ ^7p(y)¥'fe+i) = ^7ry_ioei while ^i^,^/ = {1 + 0{C^))Sf for all ip' by Corollary 
conclude that 



4.12 



we can 



ElAy.'^^^uv I J^i] = -^ ^^^,™ + -^ — -'^n,uv ■ (5.13) 

We now turn to the effect this has on the one-step change in AX. When y < z < |7r| we have 

IE [Ayz'^TT,uv I J^i] — -^ IE [Ay2^^-^„„ I J^i\ = — -^— ^^ '—^Xt^^uv H 2 '^tt,uv 

_ 6-o(l) .OiC^ 



Alternatively, if y < z = |7r| -|- 1, since the main term of the right-hand-side of (5.13) equals 



^^.^loel,,.,; + V^7r,_ioe,«^ ,^, _ 6 + o(l) ^^,uv ^ , 6 + Q(C^) ,^, 

(the error terms above are due solely to the approximation of Q) it follows that 

FfA * IFI + A^vi, -_Ax 6 + 0(1) ^.,.. , Q(0 ... 

w^pS^- n-^p n-^p-^ *7ry_io6,™ n^P 

(5.15) 

• Side boundary edges with an exceptional fan: Let yz where y < z be a side boundary edge such 
that 7r(y — 2) = e. To tackle the expected change in this scenario we will modify the argument 
from the previous case to incorporate estimates on homomorphisms from ^■Ky_2oe and ^-Ky_2oei- 
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Recall that now there is precisely 1 occurrence of e in vr (at index y — 2) and so |7r| < 2M. 
We further have y > 3 and z = y + M — 1 following (|4.1l), and z < 2M so as not to make yz an 



outer boundary edge. Thus y = z — (M — 1) < A/ + 1 and we see that TTy-2 o el E Bm as it has 
2 copies of e and ends with a 1 at index y < M + 1 (of course %-2 o e G Bm as well). 

For each ip G '^■Ky-2oe let ^p' be the composition of (f and the map the sends y i— )• y — 1 and 
z >-^ y. By definition, the forward extension -Ftt.j/^ has Ip = {0, 1, . . . , y — 2} U {y, z} and it is 
easy to verify that as in the previous case (where 7r(y — 2) 7^ e) we have 



E[Ay,^^^UV 



J'i] 



E 



J^^(y-l)ip{y)^ 



F,ip' 



(/pe*. 



Q 



■y-2° 



The rest of the argument proceeds exactly as in the case 7r(y — 2) 7^ e with the single exception 
that TTy-2 replaces all occurrences of %-i. 
y < z < \Tr\, whereas in the situation oi y < z 



Overall we obtain that (5.14) is valid whenever 

= Ivrl + 1 we have 



E[A, 



yz^ -KjUV 



H + 



6 St, 



^, 



Ax 



6 + 0(1) ^^, 



-Xtt 



I 0(0 ,. 



n^pS,,- n'^p n-^p'^ ^7Ty^2oe,uv 

(5.16) 

Outer boundary edges: Let yz with z = |7r| + 1 be an outer boundary edge. By definition, 
extending ^vr by adding a new vertex incident to y, z leads us out of Bm, foiling the argument 
we used for the other edge classes. In lieu of counting mappings of subgraphs that include yz and 
then seeking estimates on their forward extensions to .Sfyr , we will first directly identify the images 
of y, z and then estimate the number of backward extensions to ^j,. Formally, let B = Bj^^yz be 
the backward extension from yz to ^7^ and for a given ordered pair (a, b) with ab G E{G) and 
{a, b} n {u, w} = let ip""^ be the injection that maps Oi— )-u, li— )'f, yi— )-a and 2; 1— )• 6. Then 

{a,b) : {a,b}r\{u,v}=d 
abeE{G) 

Summing Yab over the 2{\E{G)\ — Yu) ordered pairs (a, 6) as above gives 6Q — 4^ei,MD) as tri- 
angles containing u are only featured in two possible orderings in this sum. The corresponding 
summation over ^B,ip as usual gives ^n,uv, and now thanks to Corollary 4.10| we infer that 

o ie^') 






E[A^,^ 



-^,1 



(3Q - 2^ 



el.uv) ^■7T,UV 



+ 



n'^p 






6 

n'^p 



^n,uv ~r 



o {e+') 

ri^p 






{\E{G)\-Y^)Q 

where the fact that ^ei,™/Q and Yu/\E{G)\ are both 0(l/n) allowed us to omit the terms ^ei^uv 

"'"- = o(C^) '"j"" , and similarly, we replaced the quantity 



and Yu at an additive cost of O(^) 

|i?(G)| = \^{n'^p — n) simply by ^ri^p at an additive cost of O(^) 



^ t: ,uv 



o{e) 



2\ '^TT,UV 



It now follows that for any outer boundary edge yz. 



E [Ayz^^^uV 



:Fi] + 



6 St, 

rfip 5^- 



^ 



Ax 

n'^p 



+ ^ — ^^ 



n?p 



n^uv • 



(5.17) 



To conclude the proof, observe that every edge type contributes the term —{6 + o{1))-^Xt,^uv to 
the summation in (5.6), and correction terms accumulate when z = |7r| + 1. Outer boundary edges 

z-h 



do not contribute any additional terms. When y 
edges and interior edges contribute an extra term of 
edges cannot have y = z — 1 for any M > 3. When y < z 
-(6+0(1)) 



np- 



-^■K/ 



Eqs. (5. 9), (5. 12) reveal that both initial 

(6 + o(l))^^X^oi,™, whereas side boundary 

1, initial edges contribute the extra term 

as per (5.10), interior edges contribute the extra term —(6+o(l)) ^-3X7^01,™ as 



per (5.12) and side boundary edges contribute the term featured in (5.15). These correction terms 



match the definition of Hq, Hi in the statement of the theorem, thus completing the proof. 
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5.2. Concentration. In this subsection we prove the following theorem: 



Theorem 5.4. Let M > 3 and let Q < 6 < 1 he the constant from Corollaries 4-10 and 4.12. For 
any it in Bm l^t vr^ denote its (|7r| — l)-prefix and define 



TT ,UV •} 



where ^0 ^^ := 1. Let C, — "^ "'^ P ^ logn and for some absolute constant k > let 

r* = t*(k) = min{t : |Q/(|nV) - l| > ^C'} • 
Then w.h.p., for every it G Bm, every u,v €z Vq and all t such that t < t^ and p{t) > pM 

where uj-,^ = 33Ai-|7r|-(A/-i)t,r y^^^/j (T^ denoting the number of copies of e in vr. 

Proof. Let r be the stopping time where for some n G Bm and u,v G Vq we have |X7r,««| > lOt^CSt^. 
To show that w.h.p. we do not encounter f as long as t < r^ and p{t) > pM, consider a pair tt, uv 
where at some step io the variable |X,r,M»;| has just entered its critical interval 



h 



1 



4e, 



We define 

and will now argue that Zt^^uv is a supermartingale as long as IXtt^^^I G 7,^ and in addition we 
have not yet encountered r A r^ and are above the fin al p robability threshold pM- Indeed, since 



A(C50 



6+o(l) 



(ctt — l)C57r, in this setting Theorem 5.1 implies that 



E[AZ^,, 



-Tr^ / 6 + o(l) 



n^p 



1)uj.„CSt, 



6 + 0(1) 3 
n'^p 






-1 H n Ovr 



n'^p 



^1 ,-, o{c 

-ol + Ril H 5— 



-O71 



(5.18) 



Examining the definitions of Ho,Hi from that theorem, we see that the contribution of Hq when 



the mentioned edge (y, |7r| + 1) is an initial edge translates in our setting to 



6+0(1) 

n'-^p 



X7T,uv\, strictly 



negative for large enough n. As we look for an upper bound on Hq + Hi, it is thus legitimate to 
ignore this case. However, in all other cases with these correction terms, we have no guarantee that 
Xa,uv for 0" / TT shares the same sign as Xtt^uv Worst case estimates for these variables (recalling 
the bounds on their absolute values, available throughout t < f) give 



< 



6 + 0(1) 



^nolC'S' 



TTOl 



6 + 0(1) 



n^p 



"^TTOlC'^Tr ) 



and similarly. 






< < 



if ^TT has a side boundary edge yz with z = |7r| + l, 7r(y — 2) ^ e 
if .^fjr has a side boundary edge yz with z = |7r| + l, 7r(y — 2) = e 
if Jfir has an interior edge yz with y < z — \ and 2; = |7r| + 1 
Returning to ( |5.18 ), when ^^r does not contain a side boundary edge yz with 2: = |7r| + 1, 

6 + 0(1) ,. 



;i + o(l))w^^_2oel 
^l + o(l))tJ^oO 



E[AZ^, 



Ti]< 



ri^p 



lOJ-TT — CJjroO 



WttoI) i,<->n -\ o On 



v?p 



1/2 + 0(1) ^0^0. 



v?p 



n^p 
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where we set cjg- = if o" Bm and the equahty stemmed from the fact that for any b G {0, 1}, 
since vr and nob share the same number of e symbols, our definition of the constants (a;(j)o-eBjv/ 
imphes that cj^ob = ^Wtt- 

Similarly, when yz with z = |7r| + 1 is a side boundary edge of ^t^ we can let r = y — 1 if 
7r(y — 2) 7^ e and r = y — 2 otherwise, and obtain that 

By definition of side boundary edges, vr has at most a single copy of e, thus uj^ = 33*^-KI-(a^-i)<^t 
and w^„3i = 33M-(r+2)-(M-i)(e:,+i) ^ ^2M-i-r-{M-i)e^ _ Furthermore, revisiting (|4l]) we have 
y = z-M = |7r|+l-Mif7r(y-2) / e (and then r = |7r|-M) while y = z-{M-l) = \\pi\-M + 2ii 
7r(y — 2) = e (and then again r = \it\ —M). In both cases we therefore get that t<;^^„^^ = ^Wtt = w,roi 
and so again we obtain that 

w^p n'^p 

In particular, E[AZ',r,™ | ^i] < for large enough n, as claimed. 

We are left with the task of bounding the one-step changes AZ given Ti in L°° and L^ norm. 
Revisiting the events £^ and £^^ defined in (5.2), observe that when aiming for an upper bound 



on |A^7r,u?;| (as opposed to tight asymptotics, where (5.3) incurred an approximation error) one 
always has 






In particular, for the sake of an upper bound on |AZ| and E[(AZ)^ | Fi] it will suffice to analyze 
the effect of each edge yz E E{^t^) separately. 

Let yz G E{^t^) with y < z and let H = {V{^t^),E{^t^)) be the extension graph featuring 



Ih = {0, 1}. As argued in the proof of Theorem 3.2, let T = T{yz) C H he the subextension of 



H with minimal scaling out of all subextensions that contain the edge yz (and, if multiple such 
subextensions exist, with minimal cardinality out of those). 

Consider some subextension of the quotient H/T. This graph can be written as K/T for some 
subextension K C H that contains all edges of T, thus Sk > St by the minimality of T, and 
equivalently Sx/t ^ 1- Part M of Corollary |3.3| now implies that for some ci > and any 
injection ip :T ^ Vq we have 

^H/T,^<SH/T{^ognrK (5.19) 

Notice that at the same time any subextension K <Z H has Sk > 1- This follows from the fact that, 
by Definition |4.1[ every vertex 1 < i < |7r| + 1 shares at most 2 edges with the vertices {j : j < i}, 
thus ck < 2{vk — i^k) and Sk > n^'"K~i-K)/M > ^ ^^^ ^^ ^j^g hypothesis p > pM- Since every K <ZT 
has Sk > 1 (being also a subextension of H) we may again appeal to Part (jTl) of Corollary 3.3 
obtain that for some C2 > and any u,v £ Vg we have 



to 



■^T,uv<ST{\ognY^. (5.20) 

A final ingredient we need is to control the number of extensions to a copy of T given the image of 
the edge yz. Formally, let To be the edgeless graph on the vertex set {0, 1,2/, -z} and consider the 
quotient T* = T/Tq (that is, the extension graph obtained from T by increasing the distinguished 
vertex set to {0,l,y, z} and removing any edges among these vertices). The choice of T implies 
that every K* C T* has Sk* > St* (as otherwise the result of moving the vertices y, z from Ik 
to Vk \ Ik and adding all edges of T among {0, 1, y, z} would yield a graph K with Sk < St and 
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a strictly smaller vertex set). Equivalently, Sx*/k* ^ 1 for all K* C T* and so, by Part (pi) of 



Corollary 3.3, we deduce that for some C3 > and any injection rj : {0, 1, y, z} — )■ Vg, 

^T/To,, <aognr- (5.21) 

We now translate these facts to L°° and L? bounds on IS.yzXT^^uv given J^j. The analysis will differ 
depending on whether or not y G {0, 1}. 

(i) In case y > I: Since T contains the edge yz we have St > n^p'^ already due to the vertices y, z 
(as argued above, each vertex i > 1 that belongs to Vr sends at most two edges backwards). 

For the L°° bound we look at the maximum number of extensions '0 € ^-k^uv all of which 
map yz to a particular edge in E{G). Equivalently, we are examining the maximum possible 
value of '^H/To,ri over all injections rj : V(Tq) — )• Vg that map — )• u and 1 — )■ f with the 
additional property of preserving all edges of the induced graph of H on {0, l,y,z}. Each 



such extension ip factors into extensions Tq "^^ T and T ^^ H, thus an application of (5.21) 



and (5.19) gives 

\Ayz^^^uv\ < max^H/To,v ^ ™^^ Y^ '^H/T,^ < '5H/T(logn)^i+'=3 ^ (522) 

Since Sh/t = Sh/St and St > n^p^ we further have 

(lo£n)^i+^3 

'7 ftp 

For the L^ bound, observe that the set of edges in E{G) that can play the role of yz for 



some copy of H rooted at uv is clearly contained in {ij){y)%p{z) : ip G ^t,™}- Using (5.20), the 
probability that such an edge is chosen in the next round is therefore at most 

y ^%m<(6 + o(i))M^. 



By (5.22), combined with the usual fact that Sj^/t = Sh/St, it follows that 



2 , ^1 / .. , ,,,J5^)2(logn)-2+2('=i+'=3) {SH)H^ognf 



E[{Ay,-^^^uvr I J'i] < (6 + 0(1)) ^ c , < 



otti'^P n^p 

where the last inequality featured some suitably large constant C > and relied on the fact 
that St > n^p'^. Integrating this over a maximum of n^p remaining steps gives 

i>io i>io -f V >J/ 

(ii) In case y E {0, 1} and 7r(l) = 1: 

• Suppose z = 2. Here V{T) = {0,1,2} and E{T) = {02,12} (any additional vertex would 
increase St)- The number of the number of extensions tp G ^n,uv which map yz to a 
prescribed edge in E{G) is exactly the number of forward extensions from yz to tt. Since 

gives 



4.12 



F = i^7r,j/z has V{T) as its distinguished vertices, Sp = Sn/inp^) and Corollary 

Ay,^.,uv = 0(5h/(V)) • (5.25) 

We claim that the probability of choosing a triangle incident to yz is 0{p/n). Indeed, if 
y = then we are looking at triangles incident to u and to some neighbor of v (to assume 
the role of the image of z), and there are A^^,^ = 0{np^) candidates for the image of z each 
sharing 0{np'^) triangles with u. The same holds symmetrically if y = 1 and we deduce that 

Ee[(A..*„.)^ I ^.1 < E o(^ Ifj) = O(^) . (5.20) 

i>io i>io \ i-v u; / 
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Suppose z > 3. Here necessarily y = 1 and either vr^ = 110 . . . or vr^ = 11 (since 7r(2) ^ 0). 
Note that if V{T) = {0, 1,2, ... ,z} then St > n'^p'^ already due to two backward edges sent 
from 2 and two such edges from z. Thus, the argument for y > 1 (Case (m)) applies in this 
case as well and implies the bounds in (|5.23|) and (5.24) for the L°° and Lr norms, resp. 



Otherwise, we must have V{T) = {0, l,z} and E{T) = {Iz}. (Indeed, including any w > z 
in V{T) would clearly increase St- If y{T) Ci A C. A for A = {2, . . . ,z — 1} then of course 
z > 3 and so tt^ = 110 ... 0. Thus, the vertex with minimal index in V{T) n A sends 2 edges 
backwards and its removal from T would decrease St, contradicting the minimality of T.) 

For the L°° bound, we can control the number of forward extensions i*V,j/2 from yz to vr via 



Corollary 4.12 If yz happens to be a side boundary edge (in our setting this will happen iff 
z concludes an (M — l)-fan at vertex 1) then Ip = V{T) = {0, 1, z} thus Sp = Sn/inp) and 

For all other edge- types Ip = {0, 1, . . . ,z} and we must examine the number of extensions 
from V{T) = {0, 1, z} to the induced graph of H on Ip. Specifically, Let R be the extension 
graph of ^TT^ with the distinguished vertex set {0, 1, z}. Recalling that vr^ = 11 ... 0, there 
are z — 2 vertices and 2{z — 1) — 1 = 22: — 3 edges in R. Observe that R is strictly balanced, 
since we can view any vertex 1 < j < z is accountable for two exclusive directed edges going 
from it to 1 and j + 1 (the maximal density is attained when all such j's are present). Finally, 
•Sr > 1 as otherwise adding the vertices to {2, . . . , z — 1} to V{T) would give an extension 
graph containing yz with a scaling of StSr < St, violating the definition of T. That is, 

As before, we count extensions ■0 ^ ^tt,™ which map yz to a prescribed edge e = vw via 
the product of "^r^uvw and max^ ^ F-n,yz,v "^^^ latter is asymptotically Sp = Sn/inpSpi) 



by Corollary 4.12 For the former we apply Theorem 3.2, yielding an upper bound of 
(1 + o{\))Spi as long as i < t~^{-s/5) and the upper bound Sr + 5^~ (logn)^ for some fixed 
/O = PiJ > for t^{^/~5)t < Ir. We have Sr > 1, hence indeed t < tR and the combined 
upper bound of Sr + uiax{SR, Sj^ (logn)^} is a valid one for ^r^uvw Moreover, since Sr 
is decreasing we see that this bound is 2Sr up to the first point where Sr < (logn)^'^ 
(namely, p = n~2z-3 °^ •' < f^-2/3+o(i) ^^^ ^,^y z > 3). Beyond that point we can simply use 

-V 

s 

npSR \ np^l^ , 

where the last inequality used the aforementioned fact that the term log^n can dominate 
Sr only starting at p < n~^''^"^°'^\ whilst it is then easily absorbed by the extra factor of 



the bound 2(logn)'^ for c = (1 + -*- ^ )p- ( Altogether, 

Aj,,*.,™ < (2 + o(l)) max{5fi , log^ n} -^ = O ( -^ ) , (5.27) 



p ^'^. The L°° bound in (5.27) is of larger order then the OiSnlinp)) bound given for the 



case where yz is a side boundary edge, thus we can safely use it for any edge-type. 

For the 1? bound, note that the probability of selecting a triangle incident to v (supporting 
the removal of yz) is X^^gjv ^vw/Q = 0(l/n). Summing the one-step expected variances 
in the usual way yields 

gE[(A,.*_)^ I H < i:o{^) = O(^) . (5.28) 

(iii) In case 7r(l) = e and y = 1: By definition yz is an initial edge, and since the corresponding 



forward extension -Ftt.j/^ has Ip = {0, l,z} we can invoke Corollary 4.12 and infer that 



Ay,^^,uv < max^i.^^^^,^ = OiSp) = OiSn/inp)) (5.29) 
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and (again by the fact that the probabUity of choosing a triangle incident to v is 0(l/n)) 

J:e[(A,.*,„)'^ I ^.1 < Eo(sSw) £ o(-^) . (5.30) 



1>J0 



1>J0 



We now combine the bounds above for analogous estimates on AZ. Comparing (5.23), (5.25) 



(5.27) and (5.29) we see that for all vr and all edges yz in ^^ we have 






(5.31) 



whereas a comparison of ( 5.24[ ), ( |5.26 ), (5.28) and (5.30) shows that similarly we always have 

Y, n{\z'i'n,uvf \J'i]<0 (^] . (5.32) 

i>io \ P / 

By summing these expressions over all yz £ E{H) we arrive at the corresponding upper bounds on 
A^TT.iii) (of the same order). Furthermore, if z < |7r| we may repeat this analysis for ^t^-j ^^^ since 
the obtained bounds will be in terms of S^- while the variable Xt^^uv features (5^/5^- )\I'^-,^^ we 
deduce that these bounds extend the aforementioned bounds to Xt^^uv itself. It is furthermore easy 
to verify that the change in Z due to the term (St^, changing deterministic with p by an additive 
term of relative order 0(l/(n^p)), is negligible in comparison with these bounds. (Indeed, it is a 
factor of n/{pC,) smaller than our Lipschitz bound.) Altogether, the L°° and cumulative LP' bounds 
hold for the supermartingale Z and we can now exploit these to obtain a large deviation estimate. 

Observe that the L°° bound in (5.31 ) is o{C,Sh) since ■n}''^p tends to infinity (it is at least as large 
as "n}' ). Thus, the first step of Z upon which |X7r,ni)| enters the critical interval I-,^ is negligible in 
terms of the order of this window and we have Z(zo) = (— l/(4e7r) + o(l))a;7rC'57r. 

Set s = (5e7r)~^a;,rC'57r as the target deviation for our supermartingale Z. The product of the 
uniform L°^ bound and s has order ^(5^)^/(np^). This is of lower order compared to the cumulative 



L^-bound in (5.32) since C ~^ given the fact that np^ > n^'*^ throughout the range p > pM- 



We may now invoke Freedman's inequality and get that for some fixed c > 0, 



3t > to : Z{t A r^ A r) > Z{to) + s] < exp 



S^/{np'^ 



-cnp ( 



-c(logn)^ 



Having arrived at an error probability that decays to at a super-polynomially rate, a union bound 
over the ensemble of {"^-k^uv) variables establishes the required estimates for all n G Bm- * 



6. Lower bound on the final number of edges 

Our lower bound of n^' ^""^^^ on the final number of edges will rely solely on the fact that for 
any fixed e > we have asymptotically tight estimates for all the co-degrees in the process at time 
t such that p{t) = n~^/^+^. 

Theorem 6.1. Suppose that for some fixed < e < g, all co-degrees satisfy Y^^v = (1 + o{\))np^ 
throughout p > po = n"^'^"*"^. Then w.h.p. the final number of edges is at least 7T,3/2-6e-o(i)_ 

Remark 6.2. The above result was formulated in accordance with our upper bound analysis, 
which achieves the required estimate over the co-degrees as long as the number of e dges is of order 
j.^3/2+£ £qj. g^j^ arbitrarily small fixed e > 0. The proof argument of Theorem 6.1 in fact yields a 
stronger result, namely that an asymptotic upper bound of np'^ over all co-degrees throughout the 
point of n^i'^h edges, for some h{n) < n^'^ that grows to oo with n, implies that the final number 
of edges is w.h.p. at least cn^''^h~^ for some absolute c > 0. 
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An important ingredient in the proof is the following straightforward lemma which provides an 
upper bound on Q valid well beyond the point where our co-degree estimates break down. An 
estimate of this nature, albeit weaker, was provided in i4j. Towards the end of the process, what 
begins as a second order term in this bound will become the main term, hence the importance of 
estimating it sharply. 

Lemma 6.3. Let Tc denote the minimal tim,e where Y^^v > 2[np'^ + n^'^) for some n, w G Vq- With 
high probability, for all t < Tc such that p{t) > n~^'^ , the number of triangles satisfies 

Q-\ [n^P^ + n^p) < n'/'^p^ . 

Proof. By a slight abuse of notation we let Q denote both the collection of all triangles in G and 
its cardinality. By definition, 

uvwGQ uvGE 

Since '^uv&E^i^,'" ~ ^Q' ^'PPlyi^S Cauchy-Schwarz shows that J2uv€E^uv — 9*3^/1-^1 ^^^ thus 

|-c/| ^{n''p — n) n'^p 

Define 

T = - (n V + n^p) + n^^^P^ 
and observe that the one-step change in this variable is deterministically given by 

A(T) = -(l + o(-\-y\\ C^n^p^ +^n^ + 2rJl^p\ = - Unp^ + 1 + I2n^l^p\ +0{p/n). 

Therefore, if we set 

Z = Q-T , Iq=[T- in^/V , T] 
it follows that whenever Q € /q we have 

IE[AZ \Fi]<2- (Snp'^ + 3 + Ibn^^^p) + (snp'^ + 1 + 12n^/^p) + 0{p/n) = -Sn^^^p + 0{p/n) < , 



where the last inequality holds for any sufficiently large n. 

Let io be the first time where Q £ Iq and write to = ^(^) and po = p{io)- For any to < Tc 

\AQ\ < 3maxYu,v = 0{npl + n^/^) , 

u,v 

and observing that this bound is of order strictly lower than n'^' ^Pq for all po ^ I/'t- we obtain that 

Z{i,) = -{\-o{l))rJlV,. 

From this point the process has at most v?pq/2 steps until its conclusion, therefore we may substi- 
tute s = gn^/^pQ and deduce from Hoeffding's inequality that for some fixed c > 

/ S^ \ / 77^/3^3 \ 

P(3i > io : Z{i) > Z{io) + s)< exp -c^^-^^— —- = exp ' " ^^ ' 



2 



If Po ^ rT^'^ then the last exponent has order n^'^/pQ > ri^'^, and otherwise it has order n^pj] > 
j^i/5 £qj, g^j^y ^^ > -r"^'^. A uulou bouud over io now completes the proof. ■ 
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Proof of Theorem 16. 1[ Fix < e < g and let zq be such that 

Po = p{io) = n-^l'^^^ . 

Condition on Ti^ and assume that up to this point we had Yu^v = (1 + o(l))n^p for all u,v £ Vq- 

For two distinct triangles uvw, xyz £ Q we write uvw ~ xyz to denote that they share an edge. 
Further denote the number of distinct triangles incident to some uvw G Q by 

Buvw = #{xyz £ Q : xyz ~ uvw} = Yu,v + i^«,«) + yv,w - 3 . 

Observe that our hypothesis on the co-degrees implies that Buvw = v?'^^"^^' for all Q triangles at 
time iQ. In addition, we claim that if uvw G Q{j — 1) then at the end of round j either uvw ^ Q{j) 
or Buvwij — 1) — Buvwij) < 4, and in the latter case the co-degree of at most 2 edges in uvw has 
changed. Indeed, if the triangle selected in round j is either uvw itself or some triangle xyz ~ uvw 
then at once uvw ^ Q{j), whereas removing a vertex disjoint triangle xyz does not affect Buvw 
Consider therefore the case of removing a triangle xyz sharing exactly one vertex in common with 
uvw, that is, w.l.o.g. x = u and {y, z} n {v, w} = 0. In this case, each of the edges uy, uz may 
participate in at most one triangle involving the edge uv and at most another involving the edge 
uw, thus at most 4 triangles can be deducted from the count of Buvw in this case whereas Yv^w 
remains unchanged. (This last scenario is tight whenever both y and z belong to Nu^v H Nu^w) 

Next we will consider a random subset ^ of all triangles in Q{io)- Initialize this set by ^{io) = 
and let it evolve until the culmination of the triangle removal process, denoting its final value by 
^i,. To formulate its evolution rule we further maintain an auxiliary set ^, also initialized by 
^(io) = 0- At round j > Iq do as follows: 

• Process the triangles Q{j) in an arbitrary order. 

• If the current triangle uvw G Q{j) has uvw ^ ^ and in addition this is the first round j after 
which some edge uv in this triangle has no other triangles incident to it (1^,^ = 1) then: 

— Add uvw to ^ . 

— Add all triangles in J\f2{uvw) to ^, where J\fi{uvw) = {xyz : xyz ~ uvw} denotes all 
triangles incident to uvw and J\f2{uvw) = J\fi{uvw) U {x'y'z' : x'y'z' ~ xyz G J\fi{uvw)}. 

(Note that at no point do we ever delete triangles from J^T or from ^ .) As explained above, 
removing some triangle xyz can affect triangles incident to at most two edges of uvw. Therefore, 
if j is the first round at the end of which uvw has some edge with co-degree 1 , and uvw is intact 
at the end of this round, then at least one of its other edges has co-degree greater than 1 and so 
Buvw > upon the insertion of uvw to ^ . 

With the above explanation in mind, consider some time ii > Iq. At time io, every triangle 
uvw G Q{io) had all of its co-degrees equal n^'^'^"^^' > 1. If uvw G Q{ii) has Buvw = then 
either uvw G .^ or necessarily there exists some xyz G J^T such that at some point iq < j < ii 
we had uvw G J\f2{xyz). By the co-degrees hypothesis we know that already at time io we had 
\Af2{xyz)\ < n'^^+°(^) for all xyz G Q{io), and since co-degrees only decrease throughout our process, 
we can conclude that 

|^(n)| > I {uvw G Q{ii) : Buvw = 0} I n-^^-°(i) . (6.1) 

Let ii be such that pi = p{ii) = ^^^^^ ^ . At this point \E\ = ^{ri^pi —n) = (^ + o(l))n'^'^(logn)~-^ 

while n^pf = 0(n'^'^(logn)^'^). Obviously, if the process concludes before time ii we have at least 
[-^(ii)! = n'^''^~"^^> edges in the final graph and there is nothing left to prove. Assume therefore that 



the process is still active at time ii. By Lemma 6.3 (noting that our assumption on the co-degrees 



satisfies the hypothesis t{ii) < Tc in that lemma for any sufficiently large n since e < g) we get 

Q{k) < |n3/2(logn)-3 + in3/2(logn)-i + n'/^ = (^ + o{l))\E{h)\ . (6.2) 

Clearly, if {uv G E{ii) : Yu,v = 0} > 5|i?(ii)| for some arbitrarily small fixed 5 > 0, that is, a 
positive fraction of the edges have no triangles incident to them, then at least 5\E{ii)\ = n^/'^~°y^i 
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edges will survive the removal process and we can conclude the proof. Assume therefore that there 
are at most o{\E{ii)\) such edges. In particular, 

Qih) = l Yl Yu,,>l\{uveE{h):Yu,yy^O}\>{l-o{l))\E{ii)\. 

uvGE(ii) 



Furthermore, together with ( |6.2[ ), this implies that almost all triangles are edge disjoint. A simple 
way to see this is to order the Q{ii) triangles arbitrarily and then sequentially add their edges to 
the initial edge set {uv £ E{ii) : Y^^v = 0}, eventually arriving at the edge set of G{ii). If for 
some fixed 5 > at least 6Q{ii) triangles share any edges with the triangles preceding them in 
this procedure then the total number of edges would be at most (3 — 6)Q{ii) + o{\E{ii)\), which 



by (6.2) is at most (1 — o + o(l))|-E'(ii)|, contradiction. 



3 

Examining the above statement we see that one can exclude o(|£'(ii)|) triangles and be left with 
a set ^ of (^ — o{l))\E{ii)\ edge disjoint triangles, i.e., Buvw = for every uvw G A. Re-adding 
the excluded triangles may now increase the value of 3^^^ for at most o{\E{ii)\) distinct triangles 
uvw £ A (an excluded triangle impacts at most 3 triangles in A). Altogether, 

# {uvw G Qih) : Buvn. = 0} = (i - o(l))|^(ii)| = n^/^-^Ci) , 



which when added to (6.1) now implies that 

\^A > l^(n)| > n3/2-4^-°W . (6.3) 

We now focus our attention on which edges among the triangles of ,^ belongs to the final 
outcome of the process. To this end, it will be useful to consider an equivalent formulation of 
the process starting from time io. Instead of drawing a uniform triangle in every step, consider a 
uniform permutation a over the triangles Q{io), and the deterministic process that examines these 
triangles sequentially and each time removes the 3 edges of the triangle under consideration iff it is 
still intact (i.e., iff all of its edges still belong to the current graph). Clearly this process is merely 
a time-rescaled version of the usual triangle removal process, hence it suffices to analyze this one 
for establishing some properties of the final output. 

For each uvw £ X define an event Euvw which will depend only on the internal ordering of the 
triangles N2{uvw) in a. The formulation of the event will depend on the constellation of triangles 
incident to uvw upon its addition to <^. Let j denote the round at the end of which uvw was 
added to .^. By definition, at the end of this round there was some edge of uvw that had no other 
triangles incident to it, that is, w.l.o.g. we had Yu^v = 1- Choose some arbitrary xyz G Q{j) which 
at the end of round j satisfied xyz £ Mi{uvw), while recalling that by construction Buvwij) > 
thus such a triangle necessarily exists. Define Suvw to be the event that a arranges xyz prior to all 
triangles Mi{xyz) (including uvw). 

First, recall that for any xyz as above J\fi{xyz) < ri^'^+°W by the co-degree hypothesis, thus 
^{Suvw) > n^'^'^^°^^K Second, observe that on the event Suvw the triangle xyz under consideration 
will be examined for possible removal prior to examining any other x'y'z' ~ xyz from Q{j), thus the 
process will delete the edges {xy, xz, yz} at that time. Only later will uvw £ Mi{xyz) be processed, 
and at that time no action will be taken due to its missing edge (the edge in common with xyz). 
In particular, the edge uv with Y^^vij) = 1, whose only incident triangle at time j was uvw, will 
belong to the final output. 

Since the triangles in ,^ are edge disjoint, the final graph contains at least ^ '^Euvw edges. 
Moreover, the occurrence of Suvw is purely a function of the internal ordering according to a 
between xyz £ J\fi{uvw) and J\fi{xyz) C J\f2{uvw). By construction, we exclude all triangles in 
M2{uvw) from possible future inclusion in ,^ (by adding them to ^), thus ^ '^Suvw stochastically 
dominates a binomial random variable with parameters Bin( I c^ I, n"^^""^^^). By ( |6.3[ ) and standard 
estimates for the binomial distribution this is at least 7),3/2-6e-o(i) -^^j^.p,^ a,s required. ■ 
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